GenCore version 5.1.6 
Copyright (c) 1993 - 2005 Compugen Ltd. 



OM protein - protein search, using sw model 



Run on: 



March 1, 2005, 08:40:57 ; Search time 205.443 Seconds 

(without alignments) 
1690.542 Million cell updates/sec 



Title: 

Perfect score: 
Sequence : 



US-10-624-932-2 
4791 

1 MAVRPGLWPALLGIVLAAWL AVAGL GQ P DAGL FT VS EAEC 898 



Scoring table: 
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Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 
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RESULT 1 
AAU85403 

ID AAU85403 standard; protein; 898 AA. 
XX 

AC AAU85403; 
XX 

DT 21-MAY-2002 (first entry) 
XX 

DE Human protein N0V1. 
XX 

KW Human; N0VX; cardiomyopathy; atherosclerosis; diabetes; 

KW cell signal processing disorder; metabolic disorder; obesity; infection; 

KW anorexia; cancer-associated cachexia; cancer; neurodegenerative disorder; 

KW Alzheimer's disease; Parkinson f s disease; immune disorder; 

KW haematopoietic disorders; dyslipidaemia; pain; asthma; hypertension; 

KW osteoporosis; Crohn's disease; multiple sclerosis; angina pectoris; 



KW myocardial infarction; ulcer; allergy; benign prostatic hypertrophy; 

KW psychosis; neurological disorder; anxiety; schizophrenia; 

KW manic depression; dementia; dyskinesia; Huntington's disease; 

KW Gilles de la Tourette ' s syndrome; gene therapy. 

XX 

OS Homo sapiens . 
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PN WO200210216-A2. 
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XX 

PA (CURA-) CURAGEN CORP. 
XX 

PI Padigaru M, Mezes P, Mishra V, Burgess C, Casman S, Grosse WM; 

PI Alsobrook JP, Lepley DM, Gerlach VL, Macdougall JR f Smithson G; 
XX 

DR WPI; 2002-180074/23. 

DR N-PSDB; ABK37922. 
XX 

PT New isolated cytoplasmic, nuclear, membrane bound, or secreted 

PT polypeptide, useful for treating cardiomyopathy, atherosclerosis, 

PT infections, cancer, neurodegenerative, metabolic, hematopoietic and 

PT immune disorders. 
XX 

PS Claim 1; Page 11; 213pp; English. 
XX 

CC The invention relates to an isolated cytoplasmic, nuclear, membrane 

CC bound, or secreted polypeptide (NOVX, x= 1-14) their variants or mature 

CC form. Also included are the nucleic acids encoding the NOVX proteins, a 

CC vector comprising the nucleic acid, a cell comprising the vector, an anti 

CC -NOVX antibody and modulators of NOVX. NOVX, the nucleic acid and the 

CC antibody are useful for treating or preventing a NOVX-associated 

CC disorder, where the disorder is selected from cardiomyopathy, 

CC atherosclerosis, diabetes, a disorder related to cell signal processing 

CC and metabolic pathway modulation, metabolic disorders, obesity, 

CC infectious disease, anorexia, cancer-associated cachexia, cancer, 

CC neurodegenerative disorders, Alzheimer 1 s disease, Parkinson's disease, 

CC immune disorders, haematopoietic disorders, and the various 

CC dyslipidaemias, metabolic disturbances associated with obesity, the 

CC metabolic syndrome X and wasting disorders associated with chronic 

CC diseases, bacterial, fungal, protozoal and viral infections, pain, 

CC bulimia, asthma, hypertension, urinary retention, osteoporosis, Crohn's 



CC disease, multiple sclerosis, Albright Hereditary Osteodystrophy, angina 

CC pectoris, myocardial infarction, ulcer, allergy, benign prostatic 

CC hypertrophy, and psychotic and neurological disorders, including anxiety, 

CC schizophrenia, manic depression, delirium, dementia, and dyskinesias, 

CC such as Huntington's disease and Gilles de la Tourette's syndrome. The 

CC nucleic acid is useful in gene therapy. The present sequence represents a 

CC NOVX protein 
XX 

SQ Sequence 898 AA; 



Query Match 100.0%; Score 4791; DB 5; Length 898; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 898; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 
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ID ADH71618 standard; protein; 898 AA. 
XX 

AC ADH71618; 
XX 

DT 25-MAR-2004 (first entry) 
XX 

DE Human protein of the invention NOV21e SEQ ID NO:514. 
XX 

KW human; cytostatic; immunomodulator ; neuroprotective; nootropic; 

KW anorectic; antidiabetic; antimicrobial; antilipaemic; gene therapy; 

KW vaccine; cancer; cachexia; Alzheimer's disease; Parkinson's disease; 

KW obesity; diabetes; infectious disease; metabolic syndrome X; 

KW dyslipidaemia. 

XX 

OS Homo sapiens. 
XX 
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2002US-0387933P. 
2002US-0387934P. 
2002US-0387960P. 
2002US-0388022P. 
2002US-0388096P. 
2002US-0389123P. 
2002US-0389118P. 
2002US-0389120P. 
2002US-0389144P. 
2002US-0389146P. 
2002US-0389729P. 
2002US-0389742P. 
2002US-0389884P. 
2002US-0390006P. 
2002US-0390209P. 
2002US-0390763P. 
2002US-0396706P. 
2002US-0401628P. 
2002US-0402156P. 
2002US-0402256P. 
2002US-0402389P. 
2002US-0402786P. 
2002US-0402816P. 
2002US-0402821P. 
2002US-0402832P. 
2002US-0403448P. 
2002US-0403459P. 
2002US-0403531P. 
2002US-0403532P. 
2002US-0403563P. 
2002US-0406317P. 
2002US-0403617P. 
2002US-0406182P. 
2002US-0406355P. 
2002US-0406240P. 
2002US-0410084P. 
2002US-0412528P. 
2002US-0412731P. 
2002US-0414801P. 
2002US-0414839P. 
2002US-0414840P. 
2002US-0414954P. 
2002US-0417186P. 
2002US-0417406P. 
2002US-0420639P. 
2002US-0421156P. 
2002US-0422690P. 



PR 01-NOV-2002; 2002US-0423130P . 

PR 05-NOV-2002; 2002US-00423798 . 

PR 05-NOV-2002; 2002US-0423798P . 

PR 12-NOV-2002; 2002US-0425453P . 
XX 

PA (CURA-) CURAGEN CORP. 
XX 

PI Alsobrook JP, Alvarez E, Anderson DW, Boldog FL, Casman SJ; 

PI Catterton E, Chapoval A, Crabtree-Bokor JR, Edinger SR, Ellerman K; 

PI Ettenberg S, Gangolli EA, Gerlach VL, Gorman L, Gunther E, Guo X; 

PI Gusev VY, Herrmann JL, Ji W, Kekuda R, Li L, Liu X, Macdougall JR; 

PI Maclachlan T, Malyankar UM, Mezick AJ, Millet I, Mishra VS; 

PI Padigaru M, Patturajan M, Pena CEA, Peyman JA, Raha D, Rastelli L; 

PI Rieger DK, Rothenberg ME, Sciore P, Shenoy SG, Shimkets RA; 

PI Smithson G, Spytek KA, Stone DJ, Vernet CAM, Voss EZ, Zhong M; 

PI Zhong H; 

XX 

DR WPI; 2004-081935/08, 

DR N-PSDB; ADH71617. 
XX 

PT New NOVX polypeptides and nucleic acid molecules useful for preventing or 

PT treating NOVX-associated disorders, e.g. cancer, diabetes, infection or 

PT obesity, and in chromosome mapping, tissue typing or pharmacogenomics . 
XX 

PS Example 21; SEQ ID NO 514; 1880pp; English. 
XX 

CC The invention relates to a novel isolated^ polypeptide (NOVX). A 

CC polypeptide of the invention has cytostatic, immunomodulator, 

CC neuroprotective, nootropic, anorectic, antidiabetic, antimicrobial, and 

CC antilipaemic activity, and may have a use in gene therapy, and as a 

CC vaccine. The polypeptides are encoded by NOVX polynucleotides comprising 

CC any of the 303 fully defined nucleotide sequences given in the 

CC specification. The polypeptide is useful in the manufacture of a 

CC medicament for treating a syndrome associated with a human disease. The 

CC polypeptide, polynucleotide and antibody are useful in diagnosing, 

CC treating or preventing NOVX-associated disorders, e.g. cancer, cachexia, 

CC Alzheimer's disease, Parkinson's disease, obesity, diabetes, infectious 

CC diseases, metabolic syndrome X or dyslipidaemias . The nucleic acids are 

CC further used as hybridisation probes, in chromosome mapping, tissue 

CC typing, preventive medicine, and pharmacogenomics. The present sequence 

CC represents a NOVX polypeptide of the invention. 

XX 

SQ Sequence 898 AA; 

Query Match 100.0%; Score 4791; DB 8; Length 898; 
Best Local Similarity 100.0%; Pred. No. 0; 

Matches 898; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 

Qy 1 MAVRP GLW PAL LG I VLAAWLRG S GAQQ S ATVAN P VP GAN P DLL PH FLVE P E DVY I VKN K P 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 1 MAWPGLWPALLGIVXAAWLRGSGAQQSATVANPVYGANPDLLP 60 

Qy 61 VLLVCKAVPATQI FFKCNGEWVRQVDHVI ERSTDGS SGLPTMEVRINVSRQQVEKVFGLE 120 

I I I M I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 61 VLLVCKAVPATQI FFKCNGEWVRQVDHVI ERSTDGS SGLPTMEVRINVSRQQVEKVFGLE 120 



Qy 121 EYWCQCVAWSSSGTTKSQKAYIRIARLRKNFEQEPLAKEVSLEQGIVLPCRPPEGIPPAE 180 



I I II I I I I I I II I I I I I I I I I I I I I I I I I I I I t I I I II I I i I I I II I I I I I I I I I I ! I I I 

Db 121 EYWCQCVAWSSSGTTKSQKAYIRIARLRKNFEQEPLAKEVSLEQGIVLPCRPPEGIPPAE 180 

Qy 181 VEWLRNEDLVDPSLDPNWITREHSLVW 240 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 181 VEWLRNEDLVDPSLDPNVYITREHSLWRQARL^ 240 

Qy 241 VNGGWSTWTEWSVCSASCGRGWQKRSRSCTNPAPLNGGAFCEGQNVQKTACATLCPVDGS 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 241 VNGGWSTWTEWSVCSASCGRGWQKRSRSCTNPAPLNGGAFCEGQNVQKTACATLCPVDGS 300 

Qy 301 WSPWSKWSACGLDCTHWRSRECSDPAPRNGGEECQGTDLDTRNCTSDLCVHSASGPEDVA 360 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I 
Db 301 WSPWSKWSACGLDCTHWRSRECSDPAPRNGGEECQGTDLDTRNCTSDLCVHSASGPEDVA 360 

Qy 361 LYVGLIAVAVCLVLLLLVLILVYCRKKEGLDSDVADSSILTSGFQPVSIKPSKADNPHLL 420 

I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I 
Db 361 LYVGLI AVAVCLVLLLLVLI LVYCRKKEGLDS DVADS S I LT SGFQPVS I KPSKADNPHLL 420 

Qy 421 TIQPDLSTTTTTYQGSLCPRQDGPSPKFQLTNGHLLSPLGGGRHTLHHSSPTSEAEEFVS 480 

I II I II II I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I 
Db 421 TIQPDLSTTTTTYQGSLCPRQDGPSPKFQLTNGHLLSPLGGGRHTLHHSSPTSEAEEFVS 480 

Qy 481 RLSTQNYFRSLPRGTSNMTYGTFNFLGGRLMI PNTGI SLLI PPDAIPRGKI YEI YLTLHK 540 

I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 481 RLSTQNYFRSLPRGTSNMT YGTFNFLGGRLMI PNTGI SLLI PPDAI PRGKI YEI YLTLHK 540 

Qy 541 PEDVRLPLAGCQTLLSPIVSCGPPGVLLTRPVILAMDHCGEPSPDSWSLRLKKQSCEGSW 600 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 541 PEDVRLPLAGCQTLLS PI VSCGPPGVLLTRPVI LAMDHCGEPS PDSWSLRLKKQSCEGSW 600 

Qy 601 EDVLHLGEEAPSHLYYCQLEASACYVFTEQLGRFALVGEALSVAAAKRLKLLLFAPVACT 660 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 601 EDVLHLGEEAPSHLYYCQLEASACYVFTEQLGRFALVGEALSVAAAKRLKLLLFAPVACT 660 

Qy 661 SLEYNIRVYCLHDTHDALKEWQLEKQLGGQLIQEPRVLHFKDSYHNLRLSIHDVPSSLW 720 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 661 SLEYNIRVYCLHDTHDALKEWQLEKQLGGQLIQEPRVLHFKDSYHNLRLSIHDVPSSLW 720 

Qy 721 KSKLLVS YQEI PFYHIWNGTQRYLHCT FTLERVS PSTS DLACKLWVWQVEGDGQS FS INF 780 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 721 KSKLLVS YQEI PFYHIWNGTQRYLHCT FTLERVS P ST S DLACKLWVWQVEGDGQS FS INF 780 

Qy 781 NITKDTRFAELLALESEAGVPALVGPSAFKI PFLIRQKI I SSLDPPCRRGADWRTLAQKL 840 

I I I I I I I I I I I I I I I I II I I I I I I I I II I I I I I I I I I I I I I II I I I I I I I II I I I I I I I I 
Db .781 NITKDTRFAELLALESEAGVPALVGPSAFKI PFLIRQKI I SSLDPPCRRGADWRTLAQKL 840 

Qy 841 HLDSHLSFFASKPSPTAMILNLWEARHFPNGNLSQLAAAVAGLGQPDAGLFTVSEAEC 898 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 841 HLDSHLSFFASKPSPTAMI LNLWEARHFPNGNLSQLAAAVAGLGQPDAGLFTVSEAEC 898 



RESULT 3 
AAU97899 

ID AAU97899 standard; protein; 898 AA. 
XX 

AC AAU97899; 



XX 

DT 27-AUG-2002 (first entry) 
XX 

DE Human netrin binding membrane receptor UNC5H-1 protein. 
XX 

KW Netrin binding membrane receptor; receptor; UNC5H-1; human; nootropic; 

KW neuroprotective; cytostatic; antiparkinsonian; cerebroprotective; cancer; 

KW central nervous system; CNS; stroke; Parkinson's disease; 

KW multiple sclerosis; Alzheimer's disease. 
XX 

OS Homo sapiens . 
XX 

FH Key Location/Qualifiers 

FT Domain 152. .223 

FT /note= "Immunoglobulin domain " 

FT Domain 247. .294 

FT /note= "Thrombospondine type 1 domain " 

FT Domain 302. .348 

FT /note= "Thrombospondine type 1 domain" 

FT Region 361. .382 

FT /note= "Transmembrane region" 

FT Domain 495. .598 

FT /note= "ZU5 domain" 

FT Domain 817. .897 

FT /note= "Death domain" 

XX 

PN WO200233080-A2. 
XX 

PD 25-APR-2002. 
XX 

PF 15-OCT-2001; 2001WO-EP011891 . 
XX 

PR 16-OCT-2000; 2000US-0240061P . 
XX 

PA (FARB ) BAYER AG. 
XX 

PI Koehler RH; 
XX 

DR WPI; 2002-463314/49. 

DR N-PSDB; ABK52891. 
XX 

PT Novel human netrin binding membrane receptor polypeptide and 

PT polynucleotides for identifying modulating agents useful in treating 

PT diseases e.g. Parkinson's disease, multiple sclerosis, stroke, 

PT Alzheimer's disease. 

XX _ 

PS Claim 1; Fig 2; 94pp; English. 

XX 

CC This invention relates to the DNA and protein sequences of a novel 

CC purified human netrin binding membrane receptor, UNC5H-1. The DNA 

CC sequence of the invention is useful as a probe for detecting a nucleic 

CC acid encoding the UNC5H-1 protein in a biological sample. The sequences 

CC of the invention are useful to screen for agents which decrease the 

CC activity of the UNC5H-1 protein. The sequences are also useful for 

CC screening agents which regulate (modulate) the activity of the protein of 

CC the invention. A pharmaceutical composition containing the protein of the 

CC invention or a reagent that modulates the activity of the UNC5H-1 protein 



CC may be useful for treating a UNC5H-1 dysfunction related disease such as 

CC cancer or a central nervous system (CNS) disorders (e.g, Parkinson's 

CC disease, multiple sclerosis, stroke and Alzheimer's disease). Fusion 

CC proteins comprising the UNC5H-1 protein are useful for generating 

CC antibodies and for in various assay systems, and the protein can be used 

CC as a bait protein in a two-hybrid assay or three-hybrid assay. The method 

CC of the invention is useful for detecting a coding sequence for the UNC5H- 

CC 1 protein. The present sequence represents the human netrin binding 

CC membrane receptor UNC5H-1 protein of the invention 

XX 

SQ Sequence 898 AA; 



Query Match 99.8%; Score 4781; DB 5; Length 898; 

Best Local Similarity 99.8%; Pred. No. 0; 

Matches 896; Conservative 1; Mismatches 1; Indels 0; Gaps 0; 



Qy 


i 


MAVT^PGLWPALLGIVTiAAWLRGSGAQQSATVANPVPGAN 


60 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


i 


MAVTUPGLWPALLGIVLAAWLRGSGAQQSAWANPWGANPDLLP 


60 


Qy 


61 


VXLVCKAVPATQIFFKCNGEWVl^QvTlHVIERSTDGSSGLPTMEVl^IWSRQQv^KVFGLE 


120 






II 1 ! 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 II 1 1 I 1 1 1 1 1 1 1 1 




Db 


61 


VXLVCKAVPATQI FFKCNGEWVRQVDHVI ERSTDGS S GLPTMEVRINVS RQQVEKVFGLE 


120 


Qy 


121 


EYWCQCVAWS S SGTTKSQKAYI RI ARLRKNFEQEPLAKEVS LEQGI VLPCRPPEGI PPAE 


180 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


121 


EYWCQCVAWSSSGTTKSQKAYIRIAYLRKNFEQEPLAKEVSLEQGIVLPCRPPEGIPPAE 


180 


Qy 


181 


VEWLRNEDLVDP S LDPNVYI T REHS LWRQARLADTAN YTCVAKNI VARRRSASAAVTVY 


240 






1 1 1 1 1 1 M M 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 II 1 1 




Db 


181 


VEWLRN ED LVD P S L D PNVY I T REH S LWRQARLADTAN YT C VAKN I VARRRS AS AAVI VY 


240 


Qy 


241 


VNGGWSTWTEWSVCSASCGRGWQKRSRSCTNPAPLNGGAFCEGQNVQKTACATLCPVDGS 


300 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 II II 1 1 II 1 1 II 1 1 1 1 1 1 II II 1 II 1 1 1 1 1 1 1 1 II 1 , 




Db 


241 


VNGGWSTWTEWSVCSASCGRGWQKRSRSCTNPAPLNGGAFCEGQNVQKTACATLCPVDGS 


300 


Qy 


301 


WSPWSKWSACGLDCTHWRSRECSDPAPRNGGEECQGTDLDTRNCTSDLCVHSASGPEDVA 


360 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 : 1 1 1 1 1 1 1 1 




Db 


301 


WSPWSKWSACGLDCTHWRSRECSDPAPRNGGEECQGTDLDTRNCTSDLCVHTASGPEDVA 


360 


Qy 


361 


LYVGLIAVAVCLVLLLLVLILVYCRKKEGLDSDVADSSILTSGFQPVSIKPSKADNPHLL 


420 






1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


361 


LWGLIAVAVCLVLLLLVLILVYCRKKEGLDSDVADSSILTSGFQPVSIKPSKADNPHLL 


420 


Qy 


421 


TIQPDLSTTTTTYQGSLCPRQDGPSPKFQLTNGHLLSPLGGGRHTLHHSSPTSEAEEFVS 


480 






1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 l-l 1 I I I I 




Db 


421 


TIQPDLSTTTTTYQGSLCPRQDGPSPKFQLTNGHLLSPLGGGRHTLHHSSPTSEAEEFVS 


480 


Qy 


481 


RLSTQNYFRSLPRGTSNMTYGTFNFLGGRLMI PNTGI SLLI PPDAIPRGKI YEI YLTLHK 


540 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


481 


RLSTQNYFRSLPRGTSNMTYGTFNFLGGRLMI PNTGI SLLI PPDAIPRGKI YEI YLTLHK 


540 


Qy 


541 


PEDVRLPLAGCQTLLSPIVSCGPPGVLLTRPVILAMDHCGEPSPDSWSLRLKKQSCEGSW 


600 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


541 


PEDVRLPLAGCQTLLSPIVSCGPPGVLLTRPVILAMDHCGEPSPDSWSLRLKKQSCEGSW 


600 


Qy 


601 


EDVLHLGEEAPSHLYYCQLEASACYVFTEQLGRFALVGEALSVAAAKRLKLLLFAPVACT 


660 



1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 



Db 


601 


EDVLHLGEEAPSHLYYCQLEASACYVFTEQLGRFALVGEALSVAAAKRLKLLLFAPVACT 


660 




661 




Ton 






1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


661 


SLEYNIRVYCLHDTHDALKEWQLEKQLGGQLIQEPRVLHFKDSYHNLRLSIHDVPSSLW 


720 


vy 




VT.TA/'Q VOITT DT7VUTTATMr* , T , /^lD VT llfT* LiT'T rD^rODCTORT A r* VT T*T\ 7T»T/"M rc/~*T\r , /~\c! ue TXTTT» 

i\oi\ijijVo r r InJ-WIMiaiyKl linCl r 1 bLKVoro 1 oUiiAQ>J\LW VWy VEjoUCjQor SXNr 


/ oU 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I I I I I 




Db 


721 


KSKLLVSYQEIPFiTilWNGTQRYLHCTFTLERVSPSTSDIACKLWVWQVEGDGQSFSINF 


780 


yy 


/ox 


NITKDTRFAELLALESEAGVPALVGPSAFKIPFLIRQKIISSLDPPCRRGADWRTLAQKL 


840 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 I | | 




Db 


781 


NITKDTRFAELLALESEIAGVPALVGPSAFKIPFLIRQKIISSLDPPCRRGADWRTLAQKL 


840 


Qy 


841 


HLDSHLSFFASKPSPTAMILNLWEARHFPNGNLSQLAAAVAGLGQPDAGLFTVSEAEC 898 






1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


841 


HLDSHLSFFASKPSPTAMILNLWEARHFPNGNLSQLAAAVAGLGQPDAGLFTVSEAEC 898 



RESULT 4 
AAU79939 

ID AAU79939 standard; protein; 899 AA. 
XX 

AC AAU79939; 
XX 

DT 15-JUL-2002 (first entry) 
XX 

DE Human UNC5-like protein N0V1. 
XX 

KW Human; NOVX polypeptide; cardiomyopathy; atherosclerosis; cancer; 

KW cell signal processing; metabolic pathway modulation; cancerous tissue; 

KW antibody; diabetes; transgenic animal; UNC5-like protein; N0V1; 

KW chromosome 13. 

XX 

OS Homo sapiens . 
XX 

PN WO200229038-A2. 
XX 

PD ll-APR-2002. 
XX 

PF 04-OCT-2001; 2001WO-US031377 . 
XX 

PR 04-OCT-2000; 2000US-0237862P . 
XX 

PA (CURA-) CURAGEN CORP. 
XX 

PI Herrmann JL, Rastelli L, Shimkets RA; 
XX 

DR WPI; 2002-340104/37. 

DR N-PSDB; ABK49422. 
XX 

PT Novel isolated NOVX polypeptide, and encoded polynucleotide, useful for 

PT treating cardiomyopathy, artherosclerosis, and cancer. 

XX 

PS Claim 1; Page 9; 180pp; English. 
XX 



CC The present invention relates to a new NOVX polypeptide having a 900 

CC (NOV1), 4349 (NOV2), 940 (NOV3) , 798 (NOV4), 865 (NOV5) , or 331 (NOV6) 

CC residue amino acid sequence, as given in the specification. The novel 

CC polypeptide, and its encoding polynucleotide, are used to treat 

CC cardiomyopathy, atherosclerosis, cancer or a disease related to cell 

CC signal processing and metabolic pathway modulation, in a human. Detecting 

CC the polypeptide or polynucleotide is useful for identifying cancerous 

CC tissue. The antibody can be used to treat diabetes or cancer. The host 

CC cells can be used to produce non-human transgenic animals useful in drug 

CC screening. The present amino acid sequence is that of the human UNC5-like 

CC protein NOV1 of the invention. This sequence is encoded by the human UNC5 

CC -like NOV1 gene located on chromosome 13 

XX 

SQ Sequence 899 AA; 



Query Match 98.1%; Score 4698.5; DB 5; Length 899; 

Best Local Similarity 98.7%; Pred. No. 0; 

Matches 888; Conservative 2; Mismatches 7; Indels 3; Gaps 3; 



Qy 


i 


MAWPGLWPALLGIVTiAAWLRGSGAQQSATVANPVPGANPDLLPHFLV^PEDWIVKNKP 


60 






1 1 II 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


i 


MAWPGLWPALLGIVTiAAWLRGSGAQQSATVANPVPGANPDLLPHFLV^PEDVTIVraKP 


60 


Qy 


61 


VXLVCKAVPATQI FFKCNGEWV^QvTDHVT ERSTDGS S GLPTMEVRINVS RQQVEKVFGLE 


120 






1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


61 


VL LVC KAVPATQ I FFKCNGEWVRQVDHVI E RS T D GS S GE PTMEVRI NVS RQQVEKVFGLE 


120 


Qy 


121 


EYWCQCVAWS S SGTTKSQKAYI RI ARLRKNFEQEPLAKEVSLEQGI VLPCRPPEGI P PAE 


180 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


121 


EYWCQCVAWSS SGTTKSQKAYI RIARLRKNFEQEPLAKETVSLEQGIVLPCRPPEGIP PAE 


180 


Qy 


181 


VEWLRNED LVD P S L D PNVYI T REH S LWRQARLADTAN YTCVAKN I VARRRS ASAAVI VY 


240 






lltllllllllllllllllllllltilllftllllLlllllflllllllllftiftiiiiiii 

1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 II 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 




Db 


181 


VTSWLRNEDLVTJPSLDPNWITREHSLV^ 


240 


Qy 


241 


VNGGWSTWTEWSVCSASCGRGWQKRSRSCTNPAPLNGGAFCEGQNV-QKTACATLCPVDG 


299 


Db 


241 


1 1 1 II 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 : 1 : 1 III 
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RESULT 5 




ADG42569 




ID 


ADG42569 standard; protein; 899 AA. 




VY 
AA 






AC 


ADG42569 ; 




XX 






DT 


26-FEB-2004 (first entry) 




XX 






DE 


Novel human NOV1. 




XX 






KW 


cytostatic; gene therapy; NOVX-agonist ; 


NOVX-antagonist; pharmaceutical; 


KW 


NOVX-associated disorder; cancer; human. 




XX 






OS 


Homo sapiens . 




XX 






PN 


US2003204052-A1. 




XX 






PD 


30-OCT-2003. 




XX 






PF 


04-OCT-2001; 2001US-00970944 . 




XX 






PR 


04-OCT-2000; 2000US-0237862P . 




XX 






PA 


(HERR/) HERRMANN J L. 




PA 


(RAST/) RASTELLI L. 




PA 


(SHIM/) SHIMKETS R A. 




XX 






PI 


Herrmann JL, Rastelli L, Shimkets RA; 




XX 






DR 


WPI; 2003-900673/82. 




DR 


N-PSDB; ADG42568. 




XX 






PT 


New NOVX gene or NOVX-specif ic antibody, 


useful for preparing a 


PT 


composition for treating or preventing a 


NOVX-associated disorder, e.g., 



PT cancer. 
XX 

PS Claim 1; SEQ ID NO 2; 118pp; English. 
XX 

CC The invention describes a new isolated polypeptide comprising: a 

CC polypeptide or its mature form comprising a sequence not given in the 

CC specification; or a variant of (A) , where one or more amino acid residues 

CC in the variant differs in no more than 15% from the amino acid sequence 

CC of the mature form. The pharmaceutical composition may be administered 

CC via oral, transdermal, rectal or parenteral route. The polypeptide, 

CC nucleic acid or antibody is useful for preparing a composition for 

CC treating or preventing a NOVX-associated disorder, e.g., cancer. This is 

CC the amino acid sequence of a human NOVX protein. 

XX 

SQ Sequence 899 AA; 

Query Match 98.1%; Score 4698.5; DB 7; Length 899; 

Best Local Similarity 98.7%; Pred. No. 0; 

Matches 888; Conservative 2; Mismatches 7; Indels 3; Gaps 3; 

MAVT^PGLWPALLGIVTWVWLRGSGAQQSAWANPVPGA^ 60 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
MAWPGLWPALLGIVXAAWLRGSGAQQSAWANPVPGANPDLLPHFLVT2PEDW 60 

VLLVCKAVPATQI FFKCNGEWVRQVDHVT ERSTDGS SGLPTMEVRINVS RQQVEKVFGLE 120 

I I I I I I I I I I II I II I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

VTjLVCKAVPATQIFFKCNGEWWQVTDHVT 120 

EYWCQCVAWS S SGTTKSQKAYI RIARLRKNFEQEPLAKEVS LEQGI VLPCRP PEGI PPAE 180 
I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
EYWCQCVAWS S SGTTKSQKAYI RIARLRKNFEQEPLAKEVS LEQGI VLPCRP PEGI P PAE 180 

VTSWLRNEDLVDPSLDPNV^ITREHSLWRQARLADT 240 

I I I I I II I I I I I I II I I I II I I I I I I I II I I I I I I I I I I I I I I I I I I I I II I I I II I I I I 
vTSWLRNEDLVT)PSLDPNWITREHSLWRQARLAD 240 

WGGWSTWTEWSVCSASCGRGWQKRSRSCTNPAPLNGGAFCEGQNV-QKTACATLCPVDG 299 
I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I : I : I I II 
VNGGWSTWTEWSVCSASCGRGWQKRSRSCTNPAPLNGGAFCEGQNVHDRTVSSLLVSVDG 300 

SWSPWSKWSACGLDCTHWRSRECSDPAPRNGGEECQGTDLDTRNCTSDLCVHSASGPEDV 359 

I I I I I I I I I I I I I I I II I I I II I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
SWSPWSKWSACGLDCTHWRSRECSDPAPRNGGEECQGTDLDTRNCTSDLCVHSASGPEDV 360 

AL YVGL I AVAVCLVL L LLVL I LVYC RKKEGL D S DVADS S I LT S G FQ P VS I KP S KADN PHL 419 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I hi I I 
AL YVGL I AVAVCLVL L LLVL I LVYC RKKEGL D S DVADS S I LT S G FQ P VS I K P S KADN PHL 420 

LTIQPDLSTTTTTYQGSLCPRQDGPSPKFQLTNGHLLSPLGGGRHTLHHSSPTSEAEEFV 479 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I ! I I I II I I I I I I I I I I I I I I I I I M I I I I I I 
LTIQPDLS-TTTTYQGSLCPRQDGPSPKFQLTNGHLLSPLGGGRHTLHHSSPTSEAEEFV 479 

SRLSTQNYFRSLPRGTSNMTYGTFNFLGGRLMIPNTGISLLIPPDAIPRGKIYEIYLTLH 539 
I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
SRLSTQNYFRSLPRGTSNMTYGTFNFLGGRLMIPNTGISLLIPPDAIPRGKIYEIYLTLH 539 



Qy 


1 


Db 


1 


Qy 


61 


Db 


61 


Qy 


121 


Db 


121 


Qy 


181 


Db 


181 


Qy 


241 


Db 


241 


Qy 


300 


Db 


301 


Qy 


360 


Db 


361 


Qy 


420 


Db 


421 


Qy 


480 


Db 


480 



Qy 



540 KPEDVRLPLAGCQTLLSPIVSCGPPGVLLTRPVI LAMDHCGEPSPDSWSLRLKKQSCEGS 



599 



1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 



Db 


540 


KPEDVRLPLAGCQTLLSPIVSCGPPGVLLTRPVI LAMDHCGEPSPDSWSLRLKKQSCEGS 


599 


vy 


DUU 


WE-DVLHLGEEAPSHLYYCQLEASACYVFTEQLGRFALVGEALSVAAAKRLKLLLFAPVA 


658 






II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 




Db 


600 


WEQDVLHLGEEAPSHLYYCQLEASACYVFTEQLGRFALVGEALSVAAAKRLKLLLFAPVA 659 






CT5LEYNIRWCLHDTHDALKEWQLEKQLGG 


718 






1 1 1 II 1 1 1 II II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I | I I I I I t I I I I I I 




Db 


660 


CTSLEYNIRVYCLHDTHDALKEWQLEKQLGGQLIQEPRVLHFKDSYHNLRLSIHDVPSS 


719 




71 Q 
/ J. 


LWKSKLLVSYQEIPFYHIWNGTQRYLHCTFTLERVSPSTSDLACKLWVWQVEGDGQSFSI 


778 






1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


720 


LWKSKLLVSYQEIPFYHIWNGTQRYLHCTFTLERVSPSTSDLACKLWVWQVEGDGQSFSI 


779 


uy 


/ / y 


NFNITKDTRFAELLALESEAGVPALVGPSAFKIPFLIRQKIISSLDPPCRRGADWRTLAQ 


838 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 II II 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


780 


NFNITKDTRFAELIALESEAGVPALVGPSAFKIPFLIRQKIISSLDPPCRRGADWRTLAQ 


839 


Qy 


839 


KLHLDSHLSFFASKPSPTAMILNLWEARHFPNGNLSQLAAAVAGLGQPDAGLFTVSEAEC 


898 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


840 


KLHLDSHLSFFASKPSPTAMILNLWEARHFPNGNLSQLAAAVAGLGQPDAGLFTVSEAEC 


899 



RESULT 6 
ADH71610 

ID ADH71610 standard; protein; 899 AA. 
XX 

AC ADH71610; 
XX 

DT 25-MAR-2004 (first entry) 
XX 

DE Human protein of the invention NOV21a SEQ ID NO: 506. 
XX 

KW human; cytostatic; immunomodulator ; neuroprotective; nootropic; 

KW anorectic; antidiabetic; antimicrobial; antilipaemic; gene therapy; 

KW vaccine; cancer; cachexia; Alzheimer's disease; Parkinson's disease; 

KW obesity; diabetes; infectious disease; metabolic syndrome X; 
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PI Zhong H; 

XX 

DR WPI; 2004-081935/08. 

DR N-PSDB; ADH71609. 
XX 

PT New NOVX polypeptides and nucleic acid molecules useful for preventing or 

PT treating NOVX-associated disorders, e.g. cancer, diabetes, infection or 

PT obesity, and in chromosome mapping, tissue typing or pharmacogenomics . 
XX 

PS Example 21; SEQ ID NO 506; 1880pp; English. 
XX 

CC The invention relates to a novel isolated polypeptide (NOVX) . A 

CC polypeptide of the invention has cytostatic, immunomodulator, 

CC neuroprotective, nootropic, anorectic, antidiabetic, antimicrobial, and 

CC antilipaemic activity, and may have a use in gene therapy, and as a 

CC vaccine. The polypeptides are encoded by NOVX polynucleotides comprising 

CC any of the 303 fully defined nucleotide sequences given in the 

CC specification. The polypeptide is useful in the manufacture of a 

CC medicament for treating a syndrome associated with a human disease. The 

CC polypeptide, polynucleotide and antibody are useful in diagnosing, 

CC treating or preventing NOVX-associated disorders, e.g. cancer, cachexia, 

CC Alzheimer's disease, Parkinson's disease, obesity, diabetes, infectious 

CC diseases, metabolic syndrome X or dyslipidaemias . The nucleic acids are 

CC further used as hybridisation probes, in chromosome mapping, tissue 

CC typing, preventive medicine, and pharmacogenomics. The present sequence 

CC represents a NOVX polypeptide of the invention. 
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ID ADH71628 standard; protein; 899 AA. 
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DE Human protein of the invention N0V21j SEQ ID NO: 524. 
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KW obesity; diabetes; infectious disease; metabolic syndrome X; 
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PT New NOVX polypeptides and nucleic acid molecules useful for preventing or 

PT treating NOVX-associated disorders , e.g. cancer, diabetes , infection or 

PT obesity, and in chromosome mapping, tissue typing or pharmacogenomics . 
XX 

PS Example 21; SEQ ID NO 524; 1880pp; English. 
XX 

CC The invention relates to a novel isolated polypeptide (NOVX) . A 

CC polypeptide of the invention has cytostatic, immunomodulator, 

CC neuroprotective, nootropic, anorectic, antidiabetic, antimicrobial, and 

CC antilipaemic activity, and may have a use in gene therapy, and as a 

CC vaccine. The polypeptides are encoded by NOVX polynucleotides comprising 

CC any of the 303 fully defined nucleotide sequences given in the 

CC specification. The polypeptide is useful in the manufacture of a 

CC medicament for treating a syndrome associated with a human disease. The 

CC polypeptide, polynucleotide and antibody are useful in diagnosing, 

CC treating or preventing NOVX-associated disorders, e.g. cancer, cachexia, 

CC Alzheimer's disease, Parkinson's disease, obesity, diabetes, infectious 

CC diseases, metabolic syndrome X or dyslipidaemias . The nucleic acids are 

CC further used as hybridisation probes, in chromosome mapping, tissue 

CC typing, preventive medicine, and pharmacogenomics. The present sequence 

CC represents a NOVX polypeptide of the invention. 
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Query Match 98.1%; Score 4698.5; DB 8; Length 899; 

Best Local Similarity 98.7%; Pred. No. 0; 

Matches 888; Conservative 2; Mismatches 7; Indels 3; Gaps 3; 
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PI Zhong H; 

XX 

DR WPI; 2004-081935/08. 

DR N-PSDB; ADH71635. 
XX 

PT New NOVX polypeptides and nucleic acid molecules useful for preventing or 

PT treating NOVX-associated disorders, e.g. cancer, diabetes, infection or 

PT obesity, and in chromosome mapping, tissue typing or pharmaco genomics . 
XX 

PS Example 21; SEQ ID NO 532; 1880pp; English. 
XX 

CC The invention relates to a novel isolated polypeptide (NOVX) . A 

CC polypeptide of the invention has cytostatic, immunomodulatory 

CC neuroprotective, nootropic, anorectic, antidiabetic, antimicrobial, and 

CC antilipaemic activity, and may have a use in gene therapy, and as a 

CC vaccine. The polypeptides are encoded by NOVX polynucleotides comprising 

CC any of the 303 fully defined nucleotide sequences given in the 

CC specification. The polypeptide is useful in the manufacture of a 

CC medicament for treating a syndrome associated with a human disease. The 

CC polypeptide, polynucleotide and antibody are useful in diagnosing, 

CC treating or preventing NOVX-associated disorders, e.g. cancer, cachexia, 

CC Alzheimer* s disease, Parkinson's disease, obesity, diabetes, infectious 

CC diseases, metabolic syndrome X or dyslipidaemias . The nucleic acids are 

CC further used as hybridisation probes, in chromosome mapping, tissue 

CC typing, preventive medicine, and pharmacogenomics . The present sequence 



CC represents a NOVX polypeptide of the invention. 
XX 

SQ Sequence 899 AA; 



Query Match 98.0%; Score 4697.5; DB 8; Length 899; 

Best Local Similarity 98.6%; Pred. No. 0; 

Matches 887; Conservative 3; Mismatches 7; Indels 3; Gaps 3; 
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Qy 779 NFNITKDTRFAELLALESEAGVPALVGPSAFKIPFLIRQKIISSLDPPCRRGADWRTLAQ 838 

I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I | M I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 780 NFNITKDTRFAELLALESEAGVPALVGPSAFKIPFLIRQKIISSLDPPCRRGADWRTLAQ 839 

Qy 839 KLHLDSHLSFFASKPSPTAMILNLWElARHFPNGNLSQIiAAAVAGLGQPDAGLFTVSEAEC 898 

I M I I I I.I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 840 KLHLDSHLSFFASKPSPTAMILNLWEARHFPNGNLSQLAAAVAGLGQPDAGLFTVSEAEC 899 
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PT New NOVX polypeptides and nucleic acid molecules useful for preventing or 

PT treating NOVX-associated disorders, e.g. cancer, diabetes, infection or 

PT obesity, and in chromosome mapping, tissue typing or pharmacogenomics . 
XX 

PS Example 21; SEQ ID NO 538; 1880pp; English. 
XX 

CC The invention relates to a novel isolated polypeptide (NOVX) . A 

CC polypeptide of the invention has cytostatic, immunomodulator, 

CC neuroprotective, nootropic, anorectic, antidiabetic, antimicrobial, and 

CC antilipaemic activity, and may have a use in gene therapy, and as a 

CC vaccine. The polypeptides are encoded by NOVX polynucleotides comprising 

CC any of the 303 fully defined nucleotide sequences given in the 

CC specification. The polypeptide is useful in the manufacture of a 

CC medicament for treating a syndrome associated with a human disease. The 

CC polypeptide, polynucleotide and antibody are useful in diagnosing, 

CC treating or preventing NOVX-associated disorders, e.g. cancer, cachexia, 

CC Alzheimer's disease, Parkinson's disease, obesity, diabetes, infectious 

CC diseases, metabolic syndrome X or dyslipidaemias . The nucleic acids are 

CC further used as hybridisation probes, in chromosome mapping, tissue 

CC typing, preventive medicine, and pharmacogenomics. The present sequence 

CC represents a NOVX polypeptide of the invention. 
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Query Match 98.0%; Score 4695.5; DB 8; Length 899; 

Best Local Similarity 98.6%; Pred. No. 0; 

Matches 887; Conservative 3; Mismatches 7; Indels 3; Gaps 3 
Qy 1 MAVTIPGLWPALLGIVIjAAWLRGSGAQQSATVANPWGANPDLL 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 1 I I I I ! I I I I I I I I I I I I I I I I I I 

Db 1 MAVT*PGLWPALLGIVXAAWLRGSGAQQSAWANPVPGANPDLL 60 

Qy 61 VLLVCKAVTATQIFFKCNGEWVT^Q^ 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I 
Db 61 VXLVCKAVPATQIFFKCNGEWVT^QvTDH^ 120 

Qy 121 EYWCQCVAWSSSGTTKSQKAYIRIARLRKNFEQEPLAKEVSLEQGIVLPCRPPEGIPPAE 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I 
Db 121 EYWCQCVAWSSSGTTKSQKAYIRIARLRKNFEQEPLAKEVSLEQGIVLPCRPPEGIPPAE 180 
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PI Alsobrook JP, Alvarez E, Anderson DW, Boldog FL, Gasman SJ; 

PI Catterton E, Chapoval A f Crabtree-Bokor JR, Edinger SR, Ellerman K; 

PI Ettenberg S, Gangolli EA, Gerlach VL, Gorman L f Gunther E, Guo X; 

PI Gusev VY, Herrmann JL, Ji W f Kekuda R, Li L, Liu X, Macdougall JR; 

PI Maclachlan T, Malyankar UM, Mezick AJ, Millet I, Mishra VS; 

PI Padigaru M, Patturajan M, Pena CEA, Peyman JA, Raha Rastelli L; 

PI Rieger DK, Rothenberg ME, Sciore P, Shenoy SG, Shimkets RA; 

PI Smithson G, Spytek KA, Stone DJ f Vernet CAM, Voss EZ, Zhong M; 

PI Zhong H; 
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DR WPI; 2004-081935/08. 

DR N-PSDB; ADH71639. 
XX 

PT New NOVX polypeptides and nucleic acid molecules useful for preventing or 

PT treating NOVX- associated disorders, e.g. cancer, diabetes, infection or 

PT obesity, and in chromosome mapping, tissue typing or pharmacogenomics . 
XX 

PS Example 21; SEQ ID NO 536; 1880pp; English. 
XX 

CC The invention relates to a novel isolated polypeptide (NOVX) . A 

CC polypeptide of the invention has cytostatic, immunomodulatory 

CC neuroprotective, nootropic, anorectic, antidiabetic, antimicrobial, and 



CC antilipaemic activity, and may have a use in gene therapy, and as a 

CC vaccine. The polypeptides are encoded by NOVX polynucleotides comprising 

CC any of the 303 fully defined nucleotide sequences given in the 

CC specification. The polypeptide is useful in the manufacture of a 

CC medicament for treating a syndrome associated with a human disease. The 

CC polypeptide, polynucleotide and antibody are useful in diagnosing, 



CC treating or preventing NOVX-associated disorders, e.g. cancer, cachexia, 

CC Alzheimer's disease, Parkinson's disease, obesity, diabetes, infectious 

CC diseases, metabolic syndrome X or dyslipidaemias . The nucleic acids are 

CC further used as hybridisation probes, in chromosome mapping, tissue 

CC typing, preventive medicine, and pharmacogenomics . The present sequence 

CC represents a NOVX polypeptide of the invention. 

XX 

SQ Sequence 899 AA; 



Query Match 98.0%; Score 4695.5; DB 8; Length 899; 

Best Local Similarity 98.6%; Pred. No. 0; 

Matches 887; Conservative 3; Mismatches 7; Indels 3; Gaps 3; 
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VXLVCKAVPATQI FFKCNGEWRQ VT)HVI ER^^ 
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VXLVCKAVPATQIFFKCNGEWRQVT)HVTERSTDGSSGEPTMEvl^INVSRQQV^KVFGLE 
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EYWCQCVAWSSSGTTKSQKAYIRIARLRKNFEQEPLAKEVSLEQGIVLPCRPPEGIPPAE 
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VEWLRNEDLVD P S LD PNVYI TREH S L VVRQARLADTAN YTCVAKN I VARRRS AS AAVT VY 
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VNGGWSTWTEWSVCSASCGRGWQKRSRSCTNPAPLNGGAFCEGQNV-QKTACATLCPVDG 
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LTIQPDLSTTTTTYQGSLCPRQDGPSPKFQLTNGHLLSPLGGGRHTLHHSSPTSEAEEFV 
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LTIQPDLS-TTTTYQGSLCPRQDGPSPKFQLTNGHLLSPLGGGRHTLHHSSPTSEAEEFV 
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SRLSTQNYFRSLPRGTSNMTYGTFNFLGGRLMI PNTGISLLIPPDAIPRGKIYEIYLTLH 
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SRLSTQNYFRSLPRGTSNMTYGTFNFLGGRLMI PNTGISLLIPPDAIPRGKIYEIYLTLH 
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KPEDVRLPLAGCQTLLS PIVSCGPPGVLLTRPVI LAMDHCGEPS PDSWSLRLKKQSCEGS 
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KPEDVRLPIiAGCQTLLSPIVSCGPPGVLLTRPVTLAMDHCGEPSPDSWSLRLKKQSCEGS 599 



Qy 600 WE-DVXHLGEEAPSHLYYCQLEASACYVTTEQLGRFALVGEALSVAAAKRLKLLLFAPVA 658 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I : I I I I I I I I 

Db 600 WEQDVLHLGEEAPSHLYYCQLEASACYVFTEQLGRFALVGEALSVAAAKRLRLLLFAPVA 659 

Qy 659 CTSLEYNIRVYCLHDTHDALKEWQLEKQLGGQLIQEPRVLHFKDSYHNLRLSIHDVPSS 718 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I 

Db 660 CT S LE YNI RVYCLHDTHDALKEWQLEKQLGGQLIQEPRVLHFKDS YHNLRLS I HDVPS S 719 

Qy 719 LWKSKLLVSYQEIPFYHIWNGTQRYLHCTFTLERVSPSTSDLACKLWVWQVEGDGQSFSI 778 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 720 LWKSKLLVSYQEIPFYHIWNGTQRYLHCTFTLERVSPSTSDLACKLWVWQVEGDGQSFSI 779 

Qy 779 NFNITKDTRFAELLALESEAGVPALVGPSAFKIPFLIRQKIISSLDPPCRRGADWRTLAQ 838 

II I I I I I I I I I II I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I II I I I I 1 I I I I I I I I I I I I 

Db 780 NFNITKDTRFAELLALESEAGVPALVGPSAFKIPFLIRQKIISSLDPPCRRGADWRTLAQ 839 

Qy 839 KLHLDSHLS FFASKPS PTAMI LNLWEARHFPNGNLSQLAAAVAGLGQPDAGLFTVS EAEC 898 

I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 840 KLHLDSHLS FFASKPS PTAMI LNLWEARHFPNGNLSQLAAAVAGLGQPDAGLFTVS EAEC 899 

RESULT 11 
ADH71632 

ID ADH71632 standard; protein; 899 AA. 
XX 

AC ADH71632; 
XX 

DT 25-MAR-2004 (first entry) 
XX 

DE Human protein of the invention NOV211 SEQ ID NO: 528. 
XX 

KW human; cytostatic; immunomodulator ; neuroprotective; nootropic; 

KW anorectic; antidiabetic; antimicrobial; antilipaemic; gene therapy; 

KW vaccine; cancer; cachexia; Alzheimer's disease; Parkinson's disease; 

KW obesity; diabetes; infectious disease; metabolic syndrome X; 

KW dyslipidaemia. 

XX 

OS Homo sapiens . 
XX 

PN WO2003102155-A2 . 
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PD ll-DEC-2003. 
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PF 03-JUN-2003; 2003WO-US0.17430 . 
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PR 03-JUN-2002; 2002US-0385120P . 
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PR 06-JUN-2002; 2002US-0387016P . 

PR 07-JUN-2002; 2002US-0386796P . 

PR 07-JUN-2002; 2002US-0386816P . 
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PI Maclachlan T, Malyankar UM, Mezick AJ, Millet I f Mishra VS; 

PI Padigaru M, Patturajan M, Pena CEA, Peyman JA, Raha D, Rastelli L; 

PI Rieger DK, Rothenberg ME, Sciore P, Shenoy SG, Shimkets RA; 

PI Smithson G, Spytek KA, Stone DJ, Vernet CAM, Voss EZ, Zhong M; 

PI Zhong H; 

XX 

DR WPI; 2004-081935/08. 

DR N-PSDB; ADH71631. 
XX 

PT New NOVX polypeptides and nucleic acid molecules useful for preventing or 

PT treating NOVX-associated disorders, e.g. cancer, diabetes, infection or 

PT obesity, and in chromosome mapping, tissue typing or pharmacogenomics . 
XX 

PS Example 21; SEQ ID NO 528; 1880pp; English. 
XX 

CC The invention relates to a novel isolated polypeptide (NOVX) . A 

CC polypeptide of the invention has cytostatic, immunomodulator, 

CC neuroprotective , nootropic, anorectic, antidiabetic, antimicrobial, and 

CC antilipaemic activity, and may have a use in gene therapy, and as a 

CC vaccine. The polypeptides are encoded by NOVX polynucleotides comprising 

CC any of the 303 fully defined nucleotide sequences given in the 

CC specification. The polypeptide is useful in the manufacture of a 

CC medicament for treating a syndrome associated with a human disease. The 

CC polypeptide, polynucleotide and antibody are useful in diagnosing, 

CC treating or preventing NOVX-associated disorders, e.g. cancer, cachexia, 

CC Alzheimer's disease f Parkinson's disease, obesity, diabetes, infectious 

CC diseases, metabolic syndrome X or dyslipidaemias . The nucleic acids are 

CC further used as hybridisation probes, in chromosome mapping, tissue 

CC typing, preventive medicine, and pharmacogenomics. The present sequence 

CC represents a NOVX polypeptide of the invention. 

XX 

SQ Sequence 899 AA; 

Query Match 98.0%; Score 4694.5; DB 8; Length 899; 

Best Local Similarity 98.6%; Pred. No. 0; 

Matches 887; Conservative 2; Mismatches 8; Indels 3; Gaps 3; 
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EYWCQCVAWSSSGTTKSQKAYIRIARLRKNFEQEPLAKEVSLEQGIVLPCRPPEGIPPAE 
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VEWLRNEDLVD P S LD PNVYI T REHS LWRQARLADTAN YTCVAKN I VARRRS AS AAVT VY 
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A /"NT C CTaJ C TTaTT 1 TTTa7 O A TC C 7\ CrrD HiTflVD CDC /TPVT DADT XT /"•/"• 7\ ZTT^T? r*/~\\T\ T r\ vrn 7V r* 7\ W T r* T"»1 rrsr* 
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VNGGWS TWTEWS VCS AS CGRGWQKRS RS CTN PAPLNGGAFCEGQNVHDRTVS S LLVS VDG 
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SWSPWSKWSACGLDCTHWRSRECSDPAPRNGGEECQGTDLDTRNCTSDLCVHSASGPEDV 
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RESULT 12 
ADH71638 

ID ADH71638 standard; protein; 899 AA. 
XX 

AC ADH71638; 
XX 

DT 25-MAR-2004 (first entry) 
XX 

DE Human protein of the invention N0V21o SEQ ID NO:534. 
XX 

KW human; cytostatic; immunomodulator ; neuroprotective; nootropic; 

KW anorectic; antidiabetic; antimicrobial; antilipaemic; gene therapy; 

KW vaccine; cancer; cachexia; Alzheimer's disease; Parkinson's disease; 

KW obesity; diabetes; infectious disease; metabolic syndrome X; 

KW dyslipidaemia. 

XX 

OS Homo sapiens. 
XX 

PN WO2003102155-A2 . 
XX 
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XX 
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DR N-PSDB; ADH71637. 
XX 

PT New NOVX polypeptides and nucleic acid molecules useful for preventing or 

PT treating NOVX-associated disorders, e.g. cancer, diabetes, infection or 

PT obesity, and in chromosome mapping, tissue typing or pharmaco genomics . 
XX 

PS Example 21; SEQ ID NO 534; 1880pp; English. 
XX 

CC The invention relates to a novel isolated polypeptide (NOVX) . A 

CC polypeptide of the invention has cytostatic, immunomodulator, 

CC neuroprotective, nootropic, anorectic, antidiabetic, antimicrobial, and 

CC antilipaemic activity, and may have a use in gene therapy, and as a 

CC vaccine. The polypeptides are encoded by NOVX polynucleotides comprising 

CC any of the 303 fully defined nucleotide sequences given in the 

CC specification. The polypeptide is useful in the manufacture of a 

CC medicament for treating a syndrome associated with a human disease. The 

CC polypeptide, polynucleotide and antibody are useful in diagnosing, 

CC treating or preventing NOVX-associated disorders, e.g. cancer, cachexia, 

CC Alzheimer's disease, Parkinson's disease, obesity, diabetes, infectious 

CC diseases, metabolic syndrome X or dyslipidaemias . The nucleic acids are 

CC further used as hybridisation probes, in chromosome mapping, tissue 

CC typing, preventive medicine, and pha rmacogenomi cs . The present sequence 

CC represents a NOVX polypeptide of the invention. 

XX 

SQ Sequence 899 AA; 

Query Match 98.0%; Score 4694.5; DB 8; Length 899; 

Best Local Similarity 98.6%; Pred. No. 0; 

Matches 887; Conservative 2; Mismatches 8; Indels 3; Gaps 3; 

Qy 1 MAVTCPGLWPALLGIVLAAWLRGSGAQQSATVANPVPGANPD 60 

I I I I I I I I I I II I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I! I I I I I 
Db 1 MAVT^PGLWPALLGIVTWVWLRGSGAQQSATVANPVPGANPDLLPHF 60 

Qy 61 VLLVCKAVPATQI FFKCNGEWVRQVDHVI ERSTDGSSGLPTMEVRINVSRQQVEKVFGLE 120 

I I I I I I M I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 61 VLLVCKAWATQIFFKCNGEWVKQVTDHVIERSTDGSSGEPT^ 120 

Qy 121 EYWCQCVAWSSSGTTKSQKAYIRIARLRKNFEQEPLAKEVSLEQGIVLPCRPPEGIPPAE 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 121 EYWCQCVAWS S SGTTKSQKAYI RI ARLRKNFEQEPLAKEVS LEQGI VLPCRPPEGI PPAE 180 

Qy 181 VEWLRNEDLVD P S LD PNVYI T REH S LVVRQARLADTAN YTCVAKNI VARRRS ASAAVI VY 240 

I I I I I I I I I I I I II I I I I I I I I I I I I I II I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I 
Db 181 VTSWLRNEDLVT)PSLDPNWITREHSLWRQARI^^ 240 

Qy 241 VNGGWSTWTEWSVCSASCGRGWQKRSRSCTNPAPLNGGAFCEGQNV-QKTACATLCPVDG 299 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I : I III 
Db 241 VNGGWSTWTEWSVCSASCGRGWQKRSRSCTNPAPLNGGAFCEGQNVHDRTVSSLLVSVDG 300 

Qy 300 SWSPWSKWSACGLDCTHWRSRECSDPAPRNGGEECQGTDLDTRNCTSDLCVHSASGPEDV 359 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 301 SWSPWSKWSACGLDCTHWRSRECSDPAPRNGGEECQGTDLDTRNCTSDLCVHSASGPEDV 360 

Qy 360 AL YVGL I AVAVCLVL L L LVL I LVYC RKKEGLD S DVAD S S I LT S G FQPVS I KP S KADN P HL 419 

I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I 
Db 361 ALYVGLIAVAVCLVLLLLVLILVYCRKKEGLDSDVADSSILTSGFQPVSIKPSKADNPHL 420 
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NFNITKDTRFAELLALESEAGVPALVGPSAFKIPFLIRQKIISSLDPPCRRGADWRTLAQ 
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KLHLDSHLSFFASKPSPTAMILNLWEARHFPNGNLSQIJWVVAGLGQPDAGLFWSE^C 
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KLHLDSHLSFFASKPSPTAMILNLWEARHFPNGNLSQLAAAVAGLGQPDAGLFTVSEAEC 
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RESULT 13 
ADH71634 

ID ADH71634 standard; protein; 899 AA. 
XX 

AC ADH71634; 
XX 

DT 25-MAR-2004 (first entry) 
XX 

DE Human protein of the invention N0V21m SEQ ID NO: 530. 
XX 



KW human; cytostatic; immunomodulator ; neuroprotective; nootropic; 

KW anorectic; antidiabetic; antimicrobial; antilipaemic; gene therapy; 

KW vaccine; cancer; cachexia; Alzheimer's disease; Parkinson's disease; 

KW obesity; diabetes; infectious disease; metabolic syndrome X; 

KW dyslipidaemia . 

XX 

OS Homo sapiens . 
XX 

PN WO2003102155-A2. 
XX 

PD ll-DEC-2003. 
XX 

PF 03-JUN-2003; 2003WO-US017430 . 
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XX 

PA (CURA-) CURAGEN CORP. 
XX 

PI Alsobrook JP, Alvarez E, Anderson DW, Boldog FL, Casman SJ; 

PI Catterton E, Chapoval A, Crabtree-Bokor JR, Edinger SR, Ellerman K; 

PI Ettenberg S, Gangolli EA, Gerlach VL, Gorman L, Gunther E, Guo X; 

PI Gusev VY, Herrmann JL, Ji W, Kekuda R, Li L, Liu X, Macdougall JR; 

PI Maclachlan T, Malyankar UM, Mezick AJ, Millet I, Mishra VS; 

PI Padigaru M, Patturajan M, Pena CEA, Peyman JA, Raha D, Rastelli L; 

PI Rieger DK, Rothenberg ME, Sciore P, Shenoy SG, Shimkets RA; 

PI Smithson G, Spytek KA f Stone DJ, Vernet CAM, Voss EZ, Zhong M; 

PI Zhong H; 

XX 

DR WPI; 2004-081935/08. 

DR N-PSDB; ADH71633. 
XX 

PT New NOVX polypeptides and nucleic acid molecules useful for preventing or 

PT treating NOVX-associated disorders, e.g. cancer, diabetes, infection or 

PT obesity, and in chromosome mapping, tissue typing or pharmacogenomics . 
XX 

PS Example 21; SEQ ID NO 530; 1880pp; English. 
XX 

CC The invention relates to a novel isolated polypeptide (NOVX) . A 

CC polypeptide of the invention has cytostatic, immunomodulator, 

CC neuroprotective, nootropic, anorectic, antidiabetic, antimicrobial, and 

CC antilipaemic activity, and may have a use in gene therapy, and as a 

CC vaccine. The polypeptides are encoded by NOVX polynucleotides comprising 

CC any of the 303 fully defined nucleotide sequences given in the 

CC specification. The polypeptide is useful in the manufacture of a 

CC medicament for treating a syndrome associated with a human disease. The 

CC polypeptide, polynucleotide and antibody are useful in diagnosing, 

CC treating or preventing NOVX-associated disorders, e.g. cancer, cachexia, 

CC Alzheimer's disease, Parkinson's disease, obesity, diabetes, infectious 

CC diseases, metabolic syndrome X or dyslipidaemias . The nucleic acids are 

CC further used as hybridisation probes, in chromosome mapping, tissue 

CC typing, preventive medicine, and pharmacogenomics. The present sequence 

CC represents a NOVX polypeptide of the invention. 



XX 

SQ Sequence 899 AA; 



Query Match 98.0%; Score 4693.5; DB 8; Length 899; 

Best Local Similarity 98.6%; Pred. No. 0; 

Matches 887; Conservative 2; Mismatches 8; Indels 3; Gaps 3; 
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MAWPGLWPALLGI VliAAWLRGSGAQQSAWANPVPG 


60 






i i i i i i i \ i i i i i i i i i i t i i i i i i i i i i i i i i i i t i i i i i i i i i i i i i i i i i i i i i i i i 
M II II II 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 t 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
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MAVRP GLWPALLGI VLAAWLRG S GAQQ S AT VAN PVP GAN P D LLPH FL VE P EDVY I VKNKP 
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Qy 
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VLLVCKAVPATQI FFKCNGEWVRQVDHVT ERSTDGS SGLPTMEVRINVS RQQVEKVFGLE 
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1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I I I t I I I I I I i I I i i i r i i i i i i i i i i i i i i i i i i i i 
1 M 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I 1 1 1 1 1 1 I 




Db 


61 


VXLVCKAVPATQI FFKCNGEWRQVDHVI ERSTDGS SGEPTMEVRINVS RQQVEKVFGLE 
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Qy 
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EYWCQCVAWSSSGTTKSQKAYIRIARLRKNFEQEPLAKEVSLEQGIVLPCRPPEGIPPAE 
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1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II II 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 M 1 1 1 1 1 1 1 




Db 
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EYWCQCVAWSSSGTTKSQKAYIRIARLRKNFEQEPLAKEVSLEQGIVLPCRPPEGIPPAE 
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VEWLRN E DLVD P S LD PNVYI T REH S LWRQARLADTAN YT CVAKN I VARRRS AS AAVT VY 
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1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I I I I I 




Db 
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VEWLRN EDLVD P S L D PNVYI T REH S LWRQARLADTAN YT CVAKN I VARRRS AS AAVIVY 
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Qy 
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WGGWSTWTEWSVCSASCGRGWQKRSRSCTNPAPLNGGAFCEGQNV-QKTACATLCPVDG 
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1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 II 1 1 : 1 : 1 III 




Db 
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VNGGWSTWT EWS VCSAS CGRGWQKRS RS CTN PAP LNGGAFCEGQNVHDRTVS S LLVS VDG 


300 


Qy 
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SWSPWSKWSACGLDCTHWRSRECSDPAPRNGGEECQGTDLDTRNCTSDLCVHSASGPEDV 
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II 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 
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SWSPWSKWSACGLDCTHWRSRECSDPAPRNGGEECQGTDLDTRNCTSDLCVHSASGPEDV 
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Qy 
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AL YVGL I AVAVC LVLL LLVL I LVYC RKKE GL D S DVAD S S I LT S G FQ P VS I KP S KADN PH L 
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1 M 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 
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ALWGLIAVAVCLVLLLLVLILVYCRKKEGLDSDVADSSILTSGFQPVSIKPSKADNPHL 


420 


Qy 


420 


LTIQPDLSTTTTTYQGSLCPRQDGPSPKFQLTNGHLLSPLGGGRHTLHHSSPTSEAEEFV 


479 






llllllll I t I I i i \ i i t i i i i i i i i i i i j i i i i i i i i i i \ t t i ■ i i i i i i i i i ■ i i i i 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 I I I I I I I M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 
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LTIQPDLS-TTTTYQGSLCPRQDGPSPKFQLTNGHLLSPLGGGRHTLHHSSPTSEAEEFV 
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S RLSTQN YFRS LPRGTSNMT YGT FNFLGGRLMI PNTGI S LLI P PDAI PRGKI YEI YLTLH 
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1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 
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SRLSTQNYFRS LPRGTSNMT YGT FNFLGGRLMI PNTGI S LLI PPDAI PRGKI YEI YLTLH 
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KPEDVI^LPLAGCQTLLSPIVSCGPPGvXLTRPVlIIJ^MDHCGEPSPDSWSLRLKKQSCEGS 
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i i i i i i i i i i i i i i i i i i i i t i i i i i i i i i i i i i i i t i i i i t i i i i i i i i i i i i i i t i i 
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KPEDVRLPLAGCQTLLSPIVSCGPPGVLLTRPVTLAMDHCGEPSPDSWSLHLKKQSCEGS 
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WE-DVLHLGEEAPSHLYYCQLEASACYVFTEQLGRFALVGEALSVAAAKRLKLLLFAPVA 
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WEQDVLHLGEEAPSHLYYCQLEASACYVFTEQLGRFALVGEALSVAAAKRLKLLLFAPVA 
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CTSLEYNIRVYCLHDTHDALKEWQLEKQLGGQLIQEPRVLHFKDSYHNLRLSIHDVPSS 


718 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 
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CTSLEYNIRVYCLHDTHDALKEWQLEKQLGGQLIQEPRVLHFKDSYHNLRLSIHDVPSS 
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LWKS KLLVS YQEI PFYH IWNGTQRYLHCT FTLERVS P ST SDLACKLWVWQVEGDGQS FS I 
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1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 



Db 720 LWKSKLLVSYQEIPFYHIWNGTQRYLHCTFTLERVSPSTSDLACKLWVWQVEGDGQSFSI 779 



Qy 779 N FN I T KDT RFAELLAL E S EAGVP ALVG P S AFK IPFLIRQKIISSLDPPC RRGADW RT LAQ 838 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 780 N FN I T KDT RFAE LLAL E S EAGVP ALVGP S AFKI PFLIRQKIISSLDPPC RRGADWRTLAQ 839 

Qy 839 KLHLDSHLSFFASKPSPTAMILNLWEARHFPNGNLSQLAAAVAGLGQPDAGLFTVSEAEC 898 

II I I I I II I I I I I I I I I I I II I II I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I i I 

Db 840 KLHLDSHLSFFASKPSPTAMILNLWEARHFPNGNLSQLAAAVAGLGQPDAGLFTVSEAEC 899 



RESULT 14 
ADH71648 

ID ADH71648 standard; protein; 899 AA. 
XX 

AC ADH71648; 
XX 

DT 25-MAR-2004 (first entry) 
XX 

DE Human protein of the invention NOV21t SEQ ID NO: 544. 
XX 

KW human; cytostatic; immunomodulator; neuroprotective; nootropic; 

KW anorectic; antidiabetic; antimicrobial; antilipaemic; gene therapy; 

KW vaccine; cancer; cachexia; Alzheimer f s disease; Parkinson's disease; 

KW obesity; diabetes; infectious disease; metabolic syndrome X; 

KW dyslipidaemia. 

XX 

OS Homo sapiens . 
XX 

PN WO2003102155-A2. 
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DR WPI; 2004-081935/08. 

DR N-PSDB; ADH71647. 
XX 

PT New NOVX polypeptides and nucleic acid molecules useful for preventing or 

PT treating NOVX-associated disorders, e.g. cancer, diabetes, infection or 

PT obesity, and in chromosome mapping, tissue typing or pharmacogenomics . 
XX 

PS Example 21; SEQ ID NO 544; 1880pp; English. 
XX 

CC The invention relates to a novel isolated polypeptide (NOVX) . A 

CC polypeptide of the invention has cytostatic, immunomodulator, 

CC neuroprotective, nootropic, anorectic, antidiabetic, antimicrobial, and 

CC antilipaemic activity, and may have a use in gene therapy, and as a 

CC vaccine. The polypeptides are encoded by NOVX polynucleotides comprising 

CC any of the 303 fully defined nucleotide sequences given in the 

CC specification. The polypeptide is useful in the manufacture of a 

CC medicament for treating a syndrome associated with a human disease. The 

CC polypeptide, polynucleotide and antibody are useful in diagnosing, 



CC treating or preventing NOVX-associated disorders, e.g. cancer, cachexia, 

CC Alzheimer's disease, Parkinson's disease, obesity, diabetes, infectious 

CC diseases, metabolic syndrome X or dyslipidaemias . The nucleic acids are 

CC further used as hybridisation probes, in chromosome mapping, tissue 

CC typing, preventive medicine, and pharmacogenomics. The present sequence 

CC represents a NOVX polypeptide of the invention. 

XX 

SQ Sequence 899 AA; 



Query Match 97.9%; Score 4691.5; DB 8; Length 899; 

Best Local Similarity 98.6%; Pred. No. 0; 

Matches 887; Conservative 2; Mismatches 8; Indels 3; Gaps 3; 



Qy 1 MAVl^PGLWPALLGIVTiAAWLRGSGAQQSATVANPVPGANPDLLPHFLV^PEDVTIVTQ^K^ 60 

I I I I I 1 I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I 
Db 1 MAVT*PGLWPALLGIVLAAWLRGSGAQQSAWANPVPGANPDLLPHFLV^PEDW 60 

Qy 61 VLLVCKAVPATQI FFKCNGEWVRQVDHVI ERSTDGS SGLPTMEVRINVS RQQVEKVFGLE 120 ... 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I II I I I 
Db 61 VXLVCKAVPATQIFFKCNGEWV^QVTDHVIERSTDGSSGEPTMEVRINVSRQQVEKVFGLE 120 

Qy 121 EYWCQCVAWSSSGTTKSQKAYIRIARLRKNFEQEPLAKEVSLEQGIVLPCRPPEGIPPAE 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I 
Db 121 EYWCQCVAWSSSGTTKSQKAYIRIARLRKNFEQEPLAKEVSLEQGIVLPCRPPEGIPPAE 180 

Qy 181 vT^WLRNEDLVT)PSLDPNVYITREHSLvVRQARLADTANYTC 240 

I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I II I I I 
Db 181 VEWLRNEDLVD P S LD PNVYI T REH S LVVRQARLADTAN YTCVAKN I VARRRS AS AAVT VY 240 



Qy 241 VNGGWSTWTEWSVCSASCGRGWQKRSRSCTNPAPLNGGAFCEGQNV-QKTACATLCPVDG 299 

I I I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I : I III 
Db 241 WGGWSTWTEWSVCSASCGRGWQKRSRSCTNPAPLNGGAFCEGQNVHDRTVSSLLVSVDG 300 

Qy 300 SWSPWSKWSACGLDCTHWRSRECSDPAPRNGGEECQGTDLDTRNCTSDLCVHSASGPEDV 359 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 301 SWSPWSKWSACGLDCTHWRSRECSDPAPRNGGEECQGTDLDTRNCTSDLCVHSASGPEDV 360 

Qy 360 ALYVGLIAVAVCLVLLLLVLI LVYCRKKEGLDSDVADSSI LTSGFQPVS I KPSKADNPHL 419 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I ! I I I I I I I 
Db 361 ALWGLIAVAVCLVLLLLVLILVYCRKKEGLDSDVADS SI LTSGFQPVS I KPSKADNPHL 420 

Qy 420 LTIQPDLSTTTTTYQGSLCPRQDGPSPKFQLTNGHLLSPLGGGRHTLHHSSPTSEAEEFV 479 

I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I 
Db 421 LTIQPDLS-TTTTYQGSLCPRQDGPSPKFQLTNGHLLSPLGGGRHTLHHSSPTSEAEEFV 479 

Qy 480 SRLSTQNYFRSLPRGTSNMTYGTFNFLGGRLMI PNTGISLLIPPDAIPRGKIYEIYLTLH 539 

I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 480 SRLSTQNYFRSLPRGTSNMTYGTFNFLGGRLMI PNTGISLLIPPDAIPRGKIYEIYLTLH 539 

Qy 540 KPEDVRLPLAGCQTLLSPIVSCGPPGVLLTRPVILAMDHCGEPSPDSWSLRLKKQSCEGS 599 

I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I 

Db 540 KPEDVRLPLAGCQTLLSPIVSCGPPGVLLTRPVILAMDHCGEPSPDSWSLRLKKQSCEGS 599 

Qy 600 WE-DVLHLGEEAPSHLYYCQLEASACYVFTEQLGRFALVGEALSVAAAKRLKLLLFAPVA 658 

II I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I 

Db 600 WEQDVLHLGEEAPSHLYYCQLEASACYVFTEQLGRFALVGEALSVAAAKRLKLLLFAPVA 659 

Qy 659 CTSLEYNIRVYCLHDTHDALKEWQLEKQLGGQLIQEPRVLHFKDSYHNLRLSIHDVPSS 718 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 660 CTSLEYNIRVYCLHDTHDALKEWQLEKQLGGQLIQEPRVLHFKDSYHNLRLSIHDVPSS 719 

Qy 719 LWKSKLLVSYQEIPFYHIWNGTQRYLHCTFTLERVSPSTSDLACKLWVWQVEGDGQSFSI 778 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I 
Db 720 LWKSKLLVSYQEIPFYHIWNGTQRYLHCTFTLERVSPSTSDLACKLWVWQVEGGGQSFSI 779 

Qy 779 N FN I T KDTRFAELLAL E S EAGVPALVG P S AFK IPFLIRQKIISSLDP P CRRGADWRT LAQ 838 

I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 780 N FN I T KDTRFAELLAL E S EAGVPALVG P S AFKI P FL I RQK 1 1 S S LD P P CRRGADWRT LAQ 839 

Qy 839 KLHLDSHLSFFASKPSPTAMILNLWEARHFPNGNLSQLAAAVAGLGQPDAGLFTVSEAEC 898 

I I I I I I I I I I I I I I I I I I I II I I I I I II I I II I I I I I I I II I I I I I I I II I I I I I I I I I I 
Db 840 KLHLDSHLSFFASKPSPTAMILNLWEARHFPNGNLSQLAAAVAGLGQPDAGLFTVSEAEC 899 



RESULT 15 
ADH71630 

ID ADH71630 standard; protein; 899 AA. 
XX 

AC ADH71630; 
XX 

DT 25-MAR-2004 (first entry) 
XX 

DE Human protein of the invention NOV21k SEQ ID NO: 526. 
XX 

KW human; cytostatic; immunomodulator ; neuroprotective; nootropic; 

KW anorectic; antidiabetic; antimicrobial; antilipaemic; gene therapy; 
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U414839P . 


q a 
oU- 


o TT* n 


onno 
Z UUZ , 


> O A A OTTO 

zuuzus- 


U4 14 840P . 


o n 


-OEiF- 


o a a o . 
Z UUZ ; 


O A A OTTO 

ZUUzUS- 


A » 1 X AC il n 

0414954P . 


a q 

uy- 


_ Ut 1 - 


O A AO 

ZUUZ , 


O A A Otto 

ZUUzUS- 


0417186P . 


a q 

uy- 


-OCT- 


o a a o « 

zUuz ; 


zOOzUS- 


0417406P . 


O O 

Zo- 


/-< rp 


zUUz ; 


OAAOTTO 

zUOzUS- 


A/t OA/TOAn 

0420639P . 


O Q 

Z o- 


-uui - 


O A A O t 

zUUz ; 


O A A OT t o 

ZUUZUS- 


A/101 i r/Tn 

U421156P . 


31- 


-OCT- 


2002; 


2002US- 


A >l 00^"O/"\tn 

0422690P. 


01- 


-NOV- 


2002; 


2002US- 


0423130P. 


05- 


-NOV- 


2002; 


2002US- 


00423798. 


05- 


-NOV- 


2002; 


2002US- 


0423798P. 


12- 


-NOV- 


2002; 


2002US- 


0425453P. 



(CURA-) CURAGEN CORP. 

Alsobrook JP, Alvarez E, Anderson DW, Boldog FL, Casman SJ; 
Catterton E, Chapoval A, Crabtree-Bokor JR,. Edinger SR, Ellerman K; 
Ettenberg S, Gangolli EA, Gerlach VL, Gorman L, Gunther E, Guo X; 
Gusev VY, Herrmann JL, Ji W, Kekuda R, Li L, Liu X, Macdougall JR; 
Maclachlan T, Malyankar UM, Mezick AJ, Millet I, Mishra VS; 
Padigaru M, Patturajan M f Pena CEA, Peyman JA, Raha Rastelli L; 

Rieger DK, Rothenberg ME, Sciore P, Shenoy SG, Shimkets RA; 
Smithson G, Spytek KA, Stone DJ, Vernet CAM, Voss EZ, Zhong M; 
Zhong H; 

WPI; 2004-081935/08. 
N-.PSDB; ADH71629. 

New NOVX polypeptides and nucleic acid molecules useful for preventing or 
treating NOVX- associated disorders, e.g. cancer, diabetes, infection or 
obesity, and in chromosome mapping, tissue typing or pharmacogenomics . 

Example 21; SEQ ID NO 526; 1880pp; English. 

The invention relates to a novel isolated polypeptide (NOVX) . A 
polypeptide of the invention has cytostatic, immunomodulator, 
neuroprotective, nootropic, anorectic, antidiabetic, antimicrobial, and 
antilipaemic activity, and may have a use in gene therapy, and as a 



CC vaccine. The polypeptides are encoded by NOVX polynucleotides comprising 

CC any of the 303 fully defined nucleotide sequences given in the 

CC specification. The polypeptide is useful in the manufacture of a 

CC medicament for treating a syndrome associated with a human disease. The 

CC polypeptide, polynucleotide and antibody are useful in diagnosing, 

CC treating or preventing NOVX-associated disorders, e.g. cancer, cachexia, 

CC Alzheimer's disease, Parkinson's disease, obesity, diabetes, infectious 

CC diseases, metabolic syndrome X or dyslipidaemias . The nucleic acids are 

CC further used as hybridisation probes, in chromosome mapping, tissue 

CC typing, preventive medicine, and pharmacogenomics . The present sequence 

CC represents a NOVX polypeptide of the invention. 

XX 

SQ Sequence 899 AA; 



Query Match 97.9%; Score 4691.5; DB 8; Length 899; 

Best Local Similarity 98.6%; Pred. No. 0; 

Matches 887; Conservative 2; Mismatches 8; Indels 3; Gaps 3; 



Qy 


1 


MAWPGLWPALLGIVXAAWLRGSGAQQSATVANPVPGANPDLLPHFLV^ 


60 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


1 


MAVRP GLWPALLG I VTiAAWLRG S GAQQS AT VAN P VP GAN P DL LP H FLVEP EDVYI VKNK P 


60 


Qy 


61 


VXLVCKAVPATQIFFKCNGEWVT*QVT)HVI 


120 






II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


61 


VTjLVCKAVPATQIFFKCNGEWVI^QVIDHVIERSTDGSSGEPTMEWINVSRQQV^KWGL^ 


120 


Qy 


121 


EYWCQCVAWSSSGTTKSQKAYIRIARLRKNFEQEPLAKEVSLEQGIVLPCRPPEGIPPAE 


180 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 




Db 


121 


EYWCQCVAWSSSGTTKSQKAYIRIARLRKNFEQEPLAKEVSLEQGIVLPCRPPEGIPPAE 


180 


Qy 


181 


VEWLRNEDLVT)PSLDPNVTITREHSLWRQARLADTAN 


240 






1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


181 


V^WLRNEDLVT>PSLDPNWITREHSLWRQARLADTAN^ 


240 


Qy 


241 


WGGWSTWTEWSVCSASCGRGWQKRSRSCTNPAPLNGGAFCEGQNV-QKTACATLCPVDG 


299 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 : 1 : I II 1 




Db 


241 


VNGGWS TWT EWS VC SAS CGRGWQKRS RS CTN PAPLNGGAFCEGQNVHDRTVS S LLVS VDG 


300 


Qy 


300 


SWSPWSKWSACGLDCTHWRSRECSDPAPRNGGEECQGTDLDTRNCTSDLCVHSASGPEDV 


359 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


301 


SWSPWSKWSACGLDCTHWRSRECSDPAPRNGGEECQGTDLDTRNCTSDLCVHSASGPEDV 


360 


Qy 


360 


ALYVGLIAVAVCLVLLLLVLILVYCRKKEGLDSDVADSSI LTSGFQPVSIKPSKADNPHL 


419 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


361 


ALYVGLIAVAVCLVIjLLLVP I LVYCRKKEGLDSDVADS SI LTSGFQPVSIKPSKADNPHL 


420 


Qy 


420 


LTIQPDLSTTTTTYQGSLCPRQDGPSPKFQLTNGHLLSPLGGGRHTLHHSSPTSEAEEFV 


479 






M 1 1 1 II 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I I I I || I I I I I I I || | | I I I I 




Db 


421 


LTIQPDLS-TTTTYQGSLCPRQDGPSPKFQLTNGHLLSPLGGGRHTLHHSSPTSEAEEFV 


479 


Qy 


480 


SRLSTQNYFRSLPRGTSNMTYGTFNFLGGRLMIPNTGISLLIPPDAIPRGKIYEIYLTLH 


539 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


480 


SRLSTQNYFRSLPRGTSNMTYGTFNFLGGRLMIPNTGISLLIPPDAIPRGKIYEIYLTLH 


539 


Qy 


540 


KPEDVRLPLAGCQTLLSPIVSCGPPGVLLTRPVILAMDHCGEPSPDSWSLRLKKQSCEGS 


599 






1 II 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


540 


KPEDVRLPLAGCQTLLSPIVSCGPPGVLLTRPVI LAMDHCGEPSPDSWSLRLKKQSCEGS 


599 









boo 






ii 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


600 


WEQDVLHLGEEAPSHLYYCQLEASACYVFTEQLGRFALVGEALSYAAAKRLKLLLFAPVA 


659 






CT* Q T i?VMTD'\/vr , T T-rrvPuriAT 1^1?\A/AT CW™>T rr*OT TAPOD\rr upvnevmTT dt oTumroco 
oXjCjIXn J.r\v I ^JjIIDI rlL/AL»i\£j v vyjjCjj\yij^iayijXyji,t'K V Jjtlc J\JJo xrlNliKijij inJJ V xr ob 








1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


660 


CTSLEYNIRVYCLHDTHDALKEWQLEKQLGGQLIQEPRVLHFKDSYHNLRLSIHDVPSS 


719 


wy 


/ J. :? 


T T*7 VQ l^T T WC V/""MTT TD TTVU T TVTKT rTHDVT VC^P I?TT COX 7"C? D C" Ti C T"\T A (~* V~T TVTt JTitfW. re/"* T\/T\ C* tt*c* T 

JjWJ\oJ\1j1jVo iyti± rJ? inlWJN^iyKxljriUi r 1 LtiKVofSl 5D1iAUKLWVWQVe,GDGQSe SI 


1 lo 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


720 


LWKSKLLVSYQEIPFYHIWNGTQRYLHCTFTLERVSPSTSDLACKLWVWQVEGDGQSFSI 


779 


Qy 


/ / y 


N £ INI Kr AtljijiaAijbol^CjVrALVCjrbAr Kl Pc LIRQKIISSLDPPCRRGADWRTIiAQ 


838 






I I I t I I I I I I I I I I I I I I I i I i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i 

1 M 1 1 M M II M 1 II II 1 1 II 1! M 1 1 II II II 1 II II II M II II 1 II 1 M 1 M 1 I I I 




Db 


780 


NTOITKDTRFAELLALESEAGVPALVGPSAFKIPFLIRQKIISSLDPPCRRGADWRTLAQ 


839 


Qy 


839 


KLHLDSHLSFFASKPSPTAMILNLWEARHFPNGNLSQl^AAVAGLGQPDAGLFTVSEAEC 


898 






1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 t 1 1 1 1 1 1 I 1 1 1 1 1 1 1 




Db 


840 


KLHLDSHLS FFASKPS PTAMI LN LWEARH F PNGN L SQ LAAAVAGL GQ P DAGL FT VS EAEC 


899 



Search completed: March l f 2005, 08:56:42 
Job time : 210.443 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2005 Compugen Ltd. 



OM protein - protein search, using sw model 
Run on: March 1, 2005, 08:42:47 



Search time 53.8066 Seconds 

(without alignments ) 

1245.848 Million cell updates/sec 



Title: 

Perfect score: 
Sequence: 

Scoring table: 



Searched: 



US-10-624-932-2 
4791 

1 MAVRPGLWPALLGI VLAAWL . 



.AVAGLGQPDAGLFTVSEAEC 898 



BLOSUM62 

Gapop 10.0 , Gapext 0.5 
513545 seqs, 74649064 residues 



Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 



513545 



Post-processing: 



Minimum Match 0% 
Maximum Match 100% 
Listing first 45 summaries 

Database : Issued_Patents_AA: * 

1: /cgn2_6/ptodata/l/iaa/5A_COMB.pep: * 

2 : /cgn2_6/ptodata/l/iaa/5B_COMB.pep: * 

3 : /cgn2__6/ptodata/ 1/iaa/ 6A_COMB . pep : * 

4 : / cgn2_6/ptodata/ 1/iaa/ 6B_COMB. pep: * 

5 : /cgn2_6/ptodata/l/iaa/PCTUS_COMB . pep : * 

6: /cgn2_6/ptodata/l/iaa/backfilesl.pep: * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 



Result 
No. 


Score 


Query 
Match 


Length 


DB 


ID 








Description 


1 


4638 


96. 8 


898 


2 


US- 


08- 


808- 


982-5 


Sequence 


5, Appli 


2 


4638 


96.8 


898 


3 


US- 


09- 


306- 


902A-5 


Sequence 


5, Appli 


3 


2815.5 


58.8 


557 


2 


US- 


08- 


808- 


982-6 


Sequence 


6, Appli 


4 


2815.5 


58.8 


557 


3 


US- 


09- 


306- 


•902A-6 


Sequence 


6, Appli 


5 


2571.5 


53.7 


943 


2 


us- 


08- 


808- 


•982-7 


Sequence 


7, Appli 


6 


2571.5 


53.7 


943 


3 


us- 


09- 


306- 


-902A-7 


Sequence 


7, Appli 


7 


2337.5 


48. 8 


769 


4 


us- 


09- 


949- 


•016-10665 


Sequence 


10665, A 


8 


1895 


39.6 


886 


4 


us- 


•09- 


969- 


•532-16 


Sequence 


16, Appl 


9 


1886.5 


39.4 


897 


4 


us- 


•09- 


•969- 


•532-14 


Sequence 


14, Appl 


10 


1878 


39.2 


900 


4 


us~ 


•09- 


•969- 


•532-12 


Sequence 


12, Appl 


11 


1869.5 


39.0 


911 


4 


us- 


-09- 


•969- 


•532-10 


Sequence 


10, Appl 



12 


1259. 5 


26. 


3 


655 


4 


us- 


09- 


969- 


532-32 


Sequence 


32 , Appl 


13 


1251 


26. 


1 


666 


4 


us- 


09- 


-969- 


532-30 


Sequence 


30 , Appl 


14 


1242.5 


25. 


9 


669 


4 


us- 


09- 


-969- 


532-28 


Sequence 


28 , Appl 


15 


1234 


25. 


8 


680 


4 


us- 


09- 


-969- 


-532-26 


Sequence 


26, Appl 


16 


1124 


23. 


5 


552 


4 


us- 


•09- 


•969- 


532-8 


Sequence 


8, Appli 


17 


1115.5 


23. 


3 


563 


4 


us- 


09- 


-969- 


-532-6 


Sequence 


6, Appli 


18 


1107 


23. 


1 


566 


4 


us- 


09- 


-969- 


•532-4 


Sequence 


4, Appli 


19 


1098 . 5 


22 . 


9 


577 


4 


us- 


09- 


•969- 


-532-2 


Sequence 


2, Appli 


20 


488 . 5 


10. 


2 


321 


4 


us- 


09- 


•969- 


-532-24 


Sequence 


24 , Appl 


21 


480 


10. 


0 


332 


4 


us- 


-09- 


-969- 


•532-22 


Sequence 


22, Appl 


22 


471.5 


9. 


8 


335 


4 


us- 


-09- 


-969- 


532-20 


Sequence 


20, Appl 


23 


463 


9 . 


7 


346 


4 


us- 


09- 


-969- 


-532-18 


Sequence 


18, Appl 


24 


296.5 


6. 


2 


1172 


1 


us- 


•08- 


-313- 


-288B-19 


Sequence 


19, Appl 


25 


296.5 


6. 


2 


1172 


4 


us- 


•09- 


-949- 


•016-6333 


Sequence 


6333, Pip 


26 


294 


6. 


1 


102 


2 


us- 


•08- 


-808- 


-982-8 


Sequence 


8, Appli 


27 


294 


6. 


1 


102 


3 


us- 


•09- 


-306- 


902A-8 


Sequence 


8, Appli 


28 


276 


5. 


8 


954 


4 


us- 


•09- 


-854- 


-845-14 


Sequence 


14, Appl 


29 


276 


5. 


8 


1049 


4 


us- 


■09- 


■854- 


-845-2 


Sequence 


2, Appli 


30 


276 


5. 


8 


1093 


4 


us- 


■09- 


-854- 


-845-4 


Sequence 


4, Appli 


31 


276 


5. 


8 


1151 


4 


us- 


-09- 


•854- 


•845-10 


Sequence 


10, Appl 


32 


272.5 


5. 


7 


939 


4 


us- 


•09- 


•854- 


-845-16 


Sequence 


16, Appl 


33 


272. 5 


5. 


7 


1034 


4 


us- 


•09- 


-854- 


■845-6 


Sequence 


6, Appli 


34 


272.5 


5. 


7 


1078 


4 


us- 


-09- 


-854- 


-845-8 


Sequence 


8, Appli 


35 


272.5 


5. 


7 


1136 


4 


us- 


■09- 


-854- 


•845-12 


Sequence 


12, Appl 


36 


272.5 


5. 


7 


1170 


4 


us- 


•09- 


•657- 


-472-2 


Sequence 


2, Appli 


37 


268 . 5 


5. 


6 


239 


5 


PCT-US93-01652-1 


Sequence 


1, Appli 


38 


268.5 


5. 


6 


1170 


1 


us- 


-08- 


-313- 


•288B-20 


Sequence 


20, Appl 


39 


249.5 


5. 


2 


441 


3 


us- 


•08- 


•985- 


-526-3 


Sequence 


3, Appli 


40 


249 


5. 


2 


479 


4 


us- 


•09- 


-270- 


-767-46823 


Sequence 


46823, A 


41 


243 


5 . 


1 


469 


1 


us- 


■08- 


-313- 


-288B-15 


Sequence 


15, Appl 


42 


243 


5. 


1 


484 


4 


us- 


-09- 


-949- 


-016-9698 


Sequence 


9698, Ap 


43 


242.5 


5. 


1 


1045 


4 


us- 


-09- 


-949- 


-016-11112 


Sequence 


11112, A 


44 


238 


5. 


0 


218 


3 


us- 


■08- 


-985- 


-526-1 


Sequence 


1, Appli 


45 


234.5 


4. 


9 


1395 


3 


us- 


-09- 


-540- 


-245A-15 


Sequence 


15, Appl 



ALIGNMENTS 



RESULT 1 
US-08-808-982-5 

Sequence 5, Application US/08808982 
Patent No. 5939271 
GENERAL INFORMATION: 

APPLICANT: Tessier-Lavigne, Marc 
APPLICANT: Leonardo, E. David 
APPLICANT: Hink, Lindsay 
APPLICANT: Masu, Masayuki 
APPLICANT: Kazuko, Keino-Masu 
TITLE OF INVENTION: Netrin Receptors 
NUMBER OF SEQUENCES: 8 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: SCIENCE & TECHNOLOGY LAW GROUP 
STREET: 268 BUSH STREET, SUITE 3200 
CITY: SAN FRANCISCO 
STATE: CALIFORNIA 
COUNTRY: USA 



ZIP: 94104 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS-DOS 
SOFTWARE: Patentln Release #1.0, Version #1.30 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/808,982 
FILING DATE: 
CLASSIFICATION: 530 
ATTORNEY/AGENT INFORMATION: 
NAME: OSMAN, RICHARD A 
REGISTRATION NUMBER: 36,627 
REFERENCE/ DOCKET NUMBER: UC96-217 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: (415) 343-4341 
TELEFAX: (415) 343-4342 
INFORMATION FOR SEQ ID NO: 5: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 898 amino acids 
TYPE: amino acid 
STRANDEDNESS: not relevant 
TOPOLOGY: not relevant 
MOLECULE TYPE: peptide 
US-08-808-982-5 

Query Match 96.8%; Score 4638; DB 2; Length 898; 

Best Local Similarity 96.0%; Pred. No. 0; 

Matches 862; Conservative 17; Mismatches 19; Indels 0; Gaps 0; 

Qy 1 MAVT^PGLWPALLGIVIiAAWLRGSGAQQSATVANPVPGANPDLLPHFL 60 

I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1 MAVI^PGLWPVXLGIVliAAWLRGSGAQQSATVANPVPGANPDL 60 

Qy 61 VLLVCKAVPATQIFFKCNGEWVRQVTDHVIERSTDGSSGLP 120 

I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 61 VLLVCKAVPATQIFFKCNGEWVT*QvT>HV^ 120 

Qy 121 EYWCQCVAWSSSGTTKSQKAYIRIARLRKNFEQEPLAKEVSLEQGIVLPCRPPEGIPPAE 180 

I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I 
Db 121 EYWCQCVAWSSSGTTKSQKAYIRIAYLRKNFEQEPLAKEVSLEQGIVLPCRPPEGIPPAE 180 

Qy 181 VEWLRNEDLVT)PSLDPNvYITREHSLVVRQARLADTANY 240 

I I I I I I I I II I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 181 VEWLRNEDLVT)PSLDPNWITREHSLVVRQARIADTANYTCVAKNIVARRR 240 

Qy 241 VNGGWSTWTEWSVCSASCGRGWQKRSRSCTNPAPLNGGAFCEGQNVQKTACATLCPVDGS 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 241 VNGGWSTWTEWSVCSASCGRGWQKRSRSCTNPAPLNGGAFCEGQNVQKTACATLCPVDGS 300 

Qy 301 WSPWSKWSACGLDCTHWRSRECSDPAPRNGGEECQGTDLDTRNCTSDLCVHSASGPEDVA 360 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I : I : I I I I I I I 

Db 301 WSSWSKWSACGLDCTHWRSRECSDPAPRNGGEECRGADLDTRNCTSDLCLHTASCPEDVA 360 



Qy 

Db 



361 
361 



LYVGLIAVAVCLVLLLLVLILVYCRKKEGLDSDVADSSILTSGFQPVSIKPSKADNPHLL 
I I : I I : I II I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
LYIGLVAVAVCLFLLLLALGLIYCRKKEGLDSDVADSSILTSGFQPVSIKPSKADNPHLL 



420 
420 



Qy 421 TIQPDLSTTTTTYQGSLCPRQDGPSPKFQLTNGHLLSPLGGGRHTLHHSSPTSEAEEFVS 480 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I II I I I I I II I I I I I I I I I : I I I 

Db 421 TIQPDLSTTTTTYQGSLCSRQDGPSPKFQLSNGHLLSPLGSGRHTLHHSSPTSEAEDFVS 480 

Qy 481 RLSTQNYFRSLPRGTSNMTYGTFNFLGGRLMI PNTGI SLLIPPDAI PRGKI YEI YLTLHK 540 

I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I 

Db 481 RLSTQNYFRSLPRGTSNMAYGTFNFLGGRLMI PNTGI SLLIPPDAI PRGKI YEI YLTLHK 540 

Qy 541 PEDVRLPLAGCQTLLSPIVSCGPPGVLLTRPVILAMDHCGEPSPDSWSLRLKKQSCEGSW 600 

I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I.I I I I I I I II 

Db 541 PEDVRLPLAGCQTLLSPWSCGPPGVLLTRPVIIJ^HCGEPSPDSWSLRLKKQSCEGSW 600 

Qy 601 EDVLHLGEEAPSHLYYCQLEASACYVFTEQLGRFALVGEALSVAAAKRLKLLLFAPVACT 660 

I I I I I I I I I : I I I I I II I I I I I I I I II II I I I I I I I I I I I I I I I I I I : I I I I I I I I I I 

Db 601 EDVLHLGEESPSHLYYCQLEAGACYVFTEQLGRFALVGEALSVAATKRLRLLLFAPVACT 660 

Qy 661 SLEYNIRVYCLHDTHDALKEWQLEKQLGGQLIQEPRVLHFKDSYHNLRLSIHDVPSSLW 720 

1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 661 S LE YN I RVYCLHDTHDALKEWQLEKQLGGQLI QEPRVLHFKDS YHNLRLS I HDVPS SLW 720 

Qy 721 KSKLLVSYQEIPFYHIWNGTQRYLHCTFTLERVSPSTSDLACKLWVWQVEGDGQSFSINF 780 

I I I I I I I II I I I I I I II I I I I : I I I I I I I I I I : : I I I I I I I I : I I I I I I I I I I I I : I I I 

Db 721 KSKLLVSYQEIPFYHIWNGTQQYLHCTFTLERINASTSDLACKVWVWQVEGDGQSFNINF 780 

Qy 781 NITKDTRFAELLALESEAGVPALVGPSAFKIPFLIRQKIISSLDPPCRRGADWRTLAQKL 840 

I I I I I I I I I I I I I I I I I II I I I I I I I I I I I II I I I I I I I : I I I I I I I II I I I I I I I I I 

Db 781 NIT KDT RFAELLALES EGGVPALVGP S AFKI P FLI RQKI IAS LDP P CS RGADWRT LAQKL 840 

Qy 841 HLDSHLSFFASKPSPTAMILNLWEARHFPNGNLSQLAAAVAGLGQPDAGLFTVSEAEC 898 

I I I II I II I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 841 HLDSHLSFFASKPSPTAMILNLWEARHFPNGNLGQLAAAVAGLGQPDAGLFTVSEAEC 898 



RESULT 2 

US-09-306-902A-5 

; Sequence 5, Application US/09306902A 
; Patent No. 6277585 

GENERAL INFORMATION: 
; APPLICANT: Tessier-Lavigne, Marc 

; Leonardo, E. David 

; Hink, Lindsay 

; Masu, Masayuki 

; Kazuko, Keino-Masu 

TITLE OF INVENTION: Netrin Receptors 
NUMBER OF SEQUENCES : 9 
; CORRESPONDENCE ADDRESS: 

ADDRESSEE: SCIENCE & TECHNOLOGY LAW GROUP 

STREET: 268 BUSH STREET, SUITE 3200 

CITY: SAN FRANCISCO 

STATE: CALIFORNIA 

COUNTRY: USA 

ZIP: 94104 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
; COMPUTER: IBM PC compatible 

; OPERATING SYSTEM: PC-DOS/MS-DOS 



SOFTWARE: Patentln Release #1.0, Version #1.30 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/09/306, 902A 
FILING DATE: 07-May-1999 
; CLASSIFICATION: <Unknown> 

ATTORNEY/ AGENT INFORMATION: 
NAME: OSMAN, RICHARD A 
; REGISTRATION NUMBER: 36,627 

REFERENCE/ DOCKET NUMBER: UC96-217 
; TELECOMMUNICATION INFORMATION: 

TELEPHONE: (415) 343-4341 
TELEFAX: (415) 343-4342 
; INFORMATION FOR SEQ ID NO: 5: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 898 amino acids 
; TYPE: amino acid 

STRANDEDNESS: not relevant 
; TOPOLOGY: not relevant 

MOLECULE TYPE: peptide 
SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
US-09-306-902A-5 

Query Match 96.8%; Score 4638; DB 3; Length 898; 

Best Local Similarity 96.0%; Pred. No. 0; 

Matches 862; Conservative 17; Mismatches 19; Indels 0; Gaps 0; 

Qy 1 MAVRPGLW PALLG I VLAAWL RGS GAQQ SAT VAN P VP GAN P DLL P H FLVE P E DVY I VKN K P 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1 MAVRPGLW PVLLG I VLAAWL RGS GAQQ SAT VAN P VP GAN P DLL P H FLVE P E DVY I VKN K P 60 

Qy 61 VLLVCKAVPATQIFFKCNGEWVRQVDHVIERSTDGSSGLPTMEVRINVSRQQVEKVFGLE 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 61 VLLVCKAVPATQ I FFKCNGEWVRQVDHVI ERS T D S S S GL PTMEVRI NVS RQQVEKVFGLE 120 

Qy 121 EYWCQCVAWS S SGTTKSQKAYI RI ARLRKNFEQEPLAKEVSLEQGI VLPCRPPEGI PPAE 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I 
Db 121 EYWCQCVAWS S SGTTKSQKAYI RI AYLRKNFEQEPLAKEVS LEQGI VLPCRP PEGI P PAE 180 

Qy 181 VEWLRNEDLVIDPSLDPNVYITREHSLVVRQARLADTANYTCVAKNIVA^ 240 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 181 VEWLRNEDLVT>PSLDPNWITREHSLVVRQARLADTANYTCVAKNIVARRRSTSAAVIW 24 0 

Qy 241 VNGGWSTWTEWSVCSASCGRGWQKRSRSCTNPAPLNGGAFCEGQNVQKTACATLCPVDGS 300 

I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I II I 

Db 241 VNGGWSTWTEWSVCSASCGRGWQKRSRSCTNPAPLNGGAFCEGQNVQKTACATLCPVDGS 300 

Qy 301 WSPWSKWSACGLDCTHWRSRECSDPAPRNGGEECQGTDLDTRNCTSDLCVHSASGPEDVA 360 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I : I : I I I I I I I 

Db 301 WSSWSKWSACGLDCTHWRSRECSDPAPRNGGEECRGADLDTRNCTSDLCLHTASCPEDVA 360 

Qy 361 LYVGLIAVAVCLVLLLLVLILVYCRKKEGLDSDVADSSILTSGFQPVSIKPSKADNPHLL 420 

I I : I I : I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 361 LYIGLVAVAVCLFLLLLALGLIYCRKKEGLDSDVADSSILTSGFQPVSIKPSKADNPHLL 420 

Qy 421 TIQPDLSTTTTTYQGSLCPRQDGPSPKFQLTNGHLLSPLGGGRHTLHHSSPTSEAEEFVS 480 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I I I I I II I I I I I I I I I I I I : I I I 
Db 421 TIQPDLSTTTTTYQGSLCSRQDGPSPKFQLSNGHLLSPLGSGRHTLHHSSPTSEAEDFVS 480 



Qy. 481 RLSTQNYFRSLPRGTSNMTYGTFNFLGGRLMIPNTGISLLIPPDAIPRGKIYEIYLTLHK 540 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I II | I I I I I I I I I I I I I I 
Db 481 RLSTQNYFRSLPRGTSNMAYGTFNFLGGRLMI PNTGISLLI PPDAI PRGKI YEI YLTLHK 540 

Qy 541 PEDVRLPLAGCQTLLSPIVSCGPPGVLLTRPVILAMDHCGEPSPDSWSLRLKKQSCEGSW 600 

I I I II I I I I I I I I I I I I : I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 541 PEDVRLPLAGCQTLLSPWSCGPPGVLLTRPVILAMDHCGEPSPDSWSLRLKKQSCEGSW 600 

Qy 601 EDVLHLGEEAPSHLYYCQLEASACYVFTEQLGRFALVGEALSVAAAKRLKLLLFAPVACT 660 

I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I I I I I 
Db 601 EDVLHLGEESPSHLYYCQLEAGACYVFTEQLGRFALVGEALSVAATKRLRLLLFAPVACT 660 

Qy 661 SLEYNIRVYCLHDTHDALKEWQLEKQLGGQLIQEPRVLHFKDSYHNLRLSIHDVPSSLW 720 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 661 SLEYNIRVYCLHDTHDALKEWQLEKQLGGQLIQEPRVLHFKDSYHNLRLSIHDVPSSLW 720 

Qy 721 KSKLLVSYQEIPFYHIWNGTQRYLHCTFTLERVSPSTSDIACKLWVWQVEGDGQSFSINF 780 

I II II I I I II I I I I I I I I I 11:1 I II II III I:: I I I I I I I I : I I I I I I I I I I I I : I I I 
Db 721 KSKLLVSYQEIPFYHIWNGTQQYLHCTFTLERINASTSDLACKVWVWQVEGDGQSFNINF 780 

Qy 781 NITKDTRFAELLALESEAGVPALVGPSAFKIPFLIRQKIISSLDPPCRRGADWRTLAQKL 840 

I II I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I 
Db 781 NITKDTRFAELLALESEGGVPALVGPSAFKIPFLIRQKIIASLDPPCSRGADWRTLAQKL 840 

Qy 841 HLDSHLSFFASKPSPTAMILNLWEARHFPNGNLSQLAAAVAGLGQPDAGLFTVSEAEC 898 

I I I I I I I I I I I I I I M I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 841 HLDSHLSFFASKPSPTAMILNLWEARHFPNGNLGQLAAAVAGLGQPDAGLFTVSEAEC 898 



RESULT 3 
US-08-808-982-6 

Sequence 6, Application US/08808982 
Patent No. 5939271 
GENERAL INFORMATION: 

APPLICANT: Tessier-Lavigne, Marc 
APPLICANT: Leonardo, E. David 
APPLICANT: Hink, Lindsay 
APPLICANT: Masu, Masayuki 
APPLICANT: Kazuko, Keino-Masu 
TITLE OF INVENTION: Netrin Receptors 
NUMBER OF SEQUENCES: 8 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: SCIENCE & TECHNOLOGY LAW GROUP 
STREET: 268 BUSH STREET, SUITE 3200 
CITY: SAN FRANCISCO 
STATE : CALI FORNIA 
COUNTRY: USA 
ZIP: 94104 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS-DOS 
SOFTWARE: Patentln Release #1.0, Version #1.30 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/808,982 
FILING DATE: 



CLASSIFICATION: 530 
ATTORNEY/ AGENT INFORMATION : 

NAME: OSMAN, RICHARD A 

REGISTRATION NUMBER: 36,627 

REFERENCE/DOCKET NUMBER: UC96-217 
TELECOMMUNICATION INFORMATION: 

TELEPHONE: (415) 343-4341 

TELEFAX: (415) 343-4342 
; INFORMATION FOR SEQ ID NO: 6: 

SEQUENCE CHARACTERISTICS: 
; LENGTH: 557 amino acids 

; TYPE: amino acid 

; STRANDEDNESS: not relevant 

TOPOLOGY: not relevant 
; MOLECULE TYPE: peptide 
US-08-808-982-6 

Query Match 58.8%; Score 2815.5; DB 2; Length 557; 

Best Local Similarity 96.8%; Pred. No. 5.8e-259; 

Matches 539; Conservative 2; Mismatches 15; Indels 1; Gaps 1; 



Qy 343 NCTSDLCVHSASGPEDVALYVGLIAVAVCLVLLLLVLILVYCRKKEGLDSDVADSSILTS 402 

I I I I I I I I : I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1 NCTSDLXVHTASGPEDVALYVGLIAVAVCLVLLLLVLILVYCRKKEGLDSDVADSSILTS 60 

Qy 4 03 GFQPVSIKPSKADNPHLLTIQPDLSTTTTTYQGSLCPRQDGPSPKFQLTNGHLLSPLGGG 462 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 61 GFQPVSIKPSKADNPHLLTIQPDLSTTTTTYQGSLCPRQDGPSPKFQLTNGHLLSPLGGG 120 

Qy 463 RHTLHHSSPTSEAEEFVSRLSTQNYFRSLPRGTSNMTYGTFNFLGGRLMI PNTGISLLIP 522 

I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I 
Db 121 RHTLHHS S PT S EAEE FVS RLSTQNYFRSLPRGTSNMTYGT FN FLGGRLMI PNTGISLLIP 180 

Qy 523 PDAIPRGKIYEIYLTLHKPEDVRLPLAGCQTLLSPIVSCGPPGVLLTRPVI LAMDHCGEP 582 

I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II II I I I I I I I I I I I 
Db 181 PDAIPRGKIYEIYLTLHKPEDVRLPLAGCQTLLSPIVSCGPPGVLLTRPVI LAMDHCGEP 240 

Qy 583 SPDSWSLRLKKQSCEGSWEDVLHLGEEAPSHLYYCQLEASACYVFTEQLGRFALVGEALS 642 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 241 SPDSWSLALKKQSCEGSWEDVLHLGEEAPSHLYYCQLEASACYVFTEQLGRFALVGEALS 300 

Qy 643 VAAAKRLKLLLFAPVACTSLEYNIRVYCLHDTHDALKEWQLEKQLGGQLIQEPRVLHFK 702 

I I I I I I I I I I II III I I I I I I I I I I III I I I I I I III II I I I I I I I II I I I I I I I I I I 
Db 301 VAAAKRLKLLLFAPVACTSLEYNIRVYCLHDTHDALKEWQLEKQLGGQLIQEPRVLHLX 360 

Qy 703 DSYHNLRLSIHDVPSSLWKSKLLVSYQEIPFYHIWNGTQRYLHCTFTLERVSPSTSDLAC 762 

I I I I I I II I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 
Db 361 DSYHNLXLSXHDVPSSLWKSKLLVSYQEIPFYHIWNGTQRYLHCTFTLERVSPSTSDLAC 420 

Qy 763 KLWVWQVEGDGQS FS INFNITKDTRFAELLALES EAGVPALVGP SAFKI P FLI RQKI I S S 822 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I 
Db 421 KLWVWQVEGDGQ SFSINFNIT KDTRFAELLALE S EAGVPALVGP SAFKI P FL I RQKI I S S 480 

Qy 823 LDPPCRRGADWRTLAQKLHLDSHLSFFASKPSPTAMILNLWEARHFPNGNLSQLAAAVAG 882 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 481 LDPPCRRGADWRTLAQKLHLDSHLSFFASKPSPTAMILNLWEARHFPNGNLSQLAAAVAG 540 



883 LGQPDAGLFT-VSEAEC 898 

I : I I I I I 
541 TXPAGRWLLSQCSEAEC 557 



RESULT 4 

US-09-306-902A-6 

; Sequence 6, Application US/09306902A 
; Patent No. 6277585 

GENERAL INFORMATION: 
; APPLICANT: Tessier-Lavigne, Marc 

; Leonardo, E. David 

; s Hink, Lindsay 

; Masu, Masayuki 

; Kazuko, Keino-Masu 

TITLE OF INVENTION: Netrin Receptors 
NUMBER OF SEQUENCES: 9 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: SCIENCE & TECHNOLOGY LAW GROUP 
; STREET: 268 BUSH STREET, SUITE 3200 

; CITY: SAN FRANCISCO 

STATE: CALIFORNIA 
COUNTRY: USA 
; ZIP: 94104 

COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: Patentln Release #1.0, Version #1.30 
; CURRENT APPLICATION DATA: 

; APPLICATION NUMBER: US/09/306, 902A 

; FILING DATE: 07-May-1999 

; CLASSIFICATION: <Unknown> 

; ATTORNEY/AGENT INFORMATION: 

NAME: OSMAN, RICHARD A 
REGISTRATION NUMBER: 36,627 
REFERENCE/ DOCKET NUMBER: UC96-217 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: (415) 343-4341 
TELEFAX: (415) 343-4342 
INFORMATION FOR SEQ ID NO: 6: 
SEQUENCE CHARACTERISTICS: 
; LENGTH: 557 amino acids 

; TYPE: amino acid 

; STRANDEDNESS: not relevant 

; TOPOLOGY: not relevant 

MOLECULE TYPE: peptide 
SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
US-09-306-902A-6 

Query Match 58.8%; Score 2815.5; DB 3; Length 557; 

Best Local Similarity 96.8%; Pred. No. 5.8e-259; 

Matches 539; Conservative 2; Mismatches 15; Indels 1; Gaps 1; 

Qy 343 NCTSDLCVHSASGPEDVALYVGLIAVAVCLVLLLLVLILVYCRKKEGLDSDVADSSILTS 402 

I I I I I I I I : I I I I I I I I I I I I II I I I I I I I I II I II I I I I I I I I I I I I I I I I I I II I I I 
Db 1 NCTSDLXVHTASGPEDVALYVGLIAVAVCLVLLLLVLILVYCRKKEGLDSDVADSSILTS 60 



Qy 

Db 



Qy 403 GFQPVSIKPSKADNPHLLTIQPDLSTTTTTYQGSLCPRQDGPSPKFQLTNGHLLSPLGGG 462 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 61 GFQPVSIKPSKADNPHLLTIQPDLSTTTTTYQGSLCPRQDGPSPKFQLTNGHLLSPLGGG 120 

Qy 463 RHTLHHSSPTSEAEEFVSRLSTQNYFRSLPRGTSNMTYGTFNFLGGRLMIPNTGISLLIP 522 

I I I II I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I 
Db 121 RHTLHHSSPTSEAEEFVSRLSTQNYFRSLPRGTSNMTYGTFNFLGGRLMIPNTGISLLIP 180 

Qy 523 PDAIPRGKIYEIYLTLHKPEDVRLPLAGCQTLLSPIVSCGPPGVLLTRPVILAMDHCGEP 582 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db . 181 PDAI PRGKI YEI YLTLHKPEDVRLPLAGCQTLLS P I VS CGPPGVLLTRPVI LAMDHCGEP 240 

Qy 583 SPDSWSLRLKKQSCEGSWEDVLHLGEEAPSHLYYCQLEASACYVFTEQLGRFALVGEALS 642 

I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 241 SPDSWSLALKKQSCEGSWEDVLHLGEEAPSHLYYCQLEASACYVFTEQLGRFALVGEALS 300 

Qy 643 VAAAKRLK LL L FAP VACT S L E YN I RVYC LHDTHDALKEWQ LEKQ LGGQLIQEP RVLH FK 702 

I I II I I I I I I I I I II I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I II I I I I I I I I I I 
Db 301 VAAAKRLKLLLFAPVACTSLEYNIRVYCLHDTHDALKEWQLEKQLGGQLIQEPRVLHLX 360 

Qy 703 DSYHNLRLSIHDVPSSLWKSKLLVSYQEIPFYHIWNGTQRYLHCTFTLERVSPSTSDLAC 762 

I I I I I I II I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I II I I I I I I 
Db 361 DSYHNLXLSXHDVPSSLWKSKLLVSYQEIPFYHIWNGTQRYLHCTFTLERVSPSTSDLAC 420 

Qy 763 KLWVWQVEGDGQS FS INFNI TKDTRFAELLALES EAGVPALVGP S AFKI P FLI RQKI I S S 822 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I 

Db 421 KLWVWQVEGDGQSFSINFNITKDTRFAELLALESEAGVPALVGPSAFKIPFLIRQKIISS 480 

Qy 823 LDPPCRRGADWRTLAQKLHLDSHLSFFASKPSPTAMILNLWEARHFPNGNLSQLAAAVAG 882 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 481 LDPPCRRGADWRTLAQKLHLDSHLSFFASKPSPTAMILNLWEARHFPNGNLSQLAAAVAG 540 

Qy 883 LGQPDAGLFT-VSEAEC 898 

I : Mill 
Db 541 TXPAGRWLLSQCSEAEC 557 

RESULT 5 
US-08-808-982-7 

Sequence 7 , Application US/08808982 
Patent No. 5939271 
GENERAL INFORMATION: 

APPLICANT: Tessier-Lavigne, Marc 
APPLICANT: Leonardo, E. David 
APPLICANT: Hink, Lindsay 
APPLICANT: Masu, Masayuki 
APPLICANT: Kazuko, Keino-Masu 
TITLE OF INVENTION: Netrin Receptors 
NUMBER OF SEQUENCES: 8 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: SCIENCE & TECHNOLOGY LAW GROUP 
STREET: 268 BUSH STREET, SUITE 3200 
CITY: SAN FRANCISCO 
STATE: CALIFORNIA 
COUNTRY: USA 
ZIP: 94104 



COMPUTER READABLE FORM: 
; MEDIUM TYPE: Floppy disk 

COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS-DOS 
; SOFTWARE: Patentln Release #1.0, Version #1.30 

CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/08/808,982 
FILING DATE: 
; CLASSIFICATION: 530 

ATTORNEY/AGENT INFORMATION: 
NAME: OSMAN, RICHARD A 
REGISTRATION NUMBER: 36,627 
REFERENCE/ DOCKET NUMBER: UC96-217 
; TELECOMMUNICATION INFORMATION: 

TELEPHONE: (415) 343-4341 
; TELEFAX: (415) 343-4342 

; INFORMATION FOR SEQ ID NO: 7: 

SEQUENCE CHARACTERISTICS: 
; LENGTH: 943 amino acids 

; TYPE: amino acid 

; STRANDEDNESS: not relevant 

; TOPOLOGY: not relevant 

MOLECULE TYPE: peptide 
US-08-808-982-7 

Query Match 53.7%; Score 2571.5; DB 2; Length 943; 

Best Local Similarity 53.3%; Pred. No. 2.5e-235; 

Matches 504; Conservative 142; Mismatches 221; Indels 79; Gaps 16; 

Qy 9 P ALLGI VLAAWL RG S GAQQ SAT VAN P VP GAN P DL L PH FL VE P EDVYI VKN K PVL LVC KAV 68 

1:111 I I I I : : II : I I I I I : I I I I I I I I I I I I I I : I 
Db 21 PSLAGI DSGAQ GLPDSFPSAPAEQLPHFLLEPEDAYIVKNKPVELHCRAF 70 

Qy 69 PATQIFFKCNGEV^QVT)HVIERSTDGSSGLPTMEvT^INVSRQQV^KVFGLEEYWCQCVA 128 

11111:11111111 I II : I I : : I I I I : I I I I I I I I :: I I I I : I I I I I I I 
Db 71 PATQIYFKCNGEWVSQKGHVTQESLDEATGLRIREVQIEVSRQQVEELFGLEDYWCQCVA 130 

Qy 129 WSSSGTTKSQKAYIRIARLRKNFEQEPLAKEVSLEQGIVLPCRPPEGIPPAEVEWLRNED 188 

I I I I I I I I I :: I I I I I I I I I I I : I I I I I I I I I : :: I I I I I I I : I I I I I I I : I I I 

Db 131 WSSSGTTKSRRAYIRIAYLRKNFDQEPLAKEVPLDHEVLLQCRPPEGVPVAEVEWLKNED 190 

Qy 189 LVD P S LD PNVY I T REH S LWRQARLADTAN YT CVAKN I VARRRS AS AAVI VYVN GGWS TW 248 

: : I I : I I : I : I : I : : I I I I I : I I I I I I I I I I I I I I : I I I : I I I I I I I I I I I : I 
Db 191 VI DPAQDTNFLLT I DHNLI I RQARLS DTAN YTCVAKNI VAKRRSTTATVI VYVNGGWS SW 250 

Qy 249 TEWSVCSASCGRGWQKRSRSCTNPAPLNGGAFCEGQNVQKTACATLCPVDGSWSPWSKWS 308 

III II I I I I I I I I : I : I II I I I I I I I I I I I I I I I I I I I : I I I I I : I : I I I I I 
Db 251 AEWSPCSNRCGRGWQKRTRTCTNPAPLNGGAFCEGQACQKTACTTVCPVDGAWTEWSKWS 310 

Qy 309 ACGLDCTHWRSRECSDPAPRNGGEECQGTDLDTRNCTSDLCV HSASGPE 357 

II : I I I I I I I I I I : I I I : I II I I :: I I I III : : I : 

Db 311 ACSTECAHWRSRECMAPPPQNGGRDCSGTLLDSKNCTDGLCVLNQRTLNDPKSRPLEPSG 370 

Qy 358 DVALYVGL-IAVAVCLVLLLLVLILVYCRKKEGLDSDVADSS-ILTSGFQPVSIKPSKAD 415 

I I I I I I I : II I I : I : I : : I I I I : I : I I I I I I I I I : I :: 

Db 371 DVAL YAGL VVAVFVVLAVTiMAVGVI VYRRNCRDFDTD I T D S S AALTGGFH PVN FKTARP S 430 



Qy 


416 


NPHLL — TIQPDLSTTTTTYQGSLCPRQDGPSPKFQLTNGHLLSPL 

II II : III:: 1 : 1 : II : 1 : 1 1 1 1 1 1 

NPQLLHPSAPPDLTASAGIYRGPVYALQDS-ADKI PMTNSPLLDPLPSLKIKVYDSSTIG 


459 


Db 


431 


489 


Qy 


460 


-GGG RHTLHHSSPTSEAEEFVSRLSTQNYFRSLPRGT 

II 1 1 1 1 : | : | : | | | 
SGAGLADGADLLGVLPPGTYPGDFSRDTHFLHLRS ASLGSQ-HLLGLPRDP 


495 


Db 


490 


539 


Qy 


496 


SNMTYGTFNFLGGRLMI PNTGISLLIPPDAIPRGKIYEIYLTLHKPEDVRLPLA-GCQTL 

1 : III 1 1 1 1 1 II 1 1 : 1 1 1 : 1 1 1 1 : 1 1 1 : : 1 1 : : 1 1 1 1 1 : 1 1 1 : 
SSSVSGTFGCLGGRLTIPGTGVSLLVPNGAIPQGKFYDLYLRINKTEST-LPLSEGSQTV 


554 


Db 


540 


598 


Qy 


555 


LSPIVSCGPPGVLLTRPVT LAMDHCGEPSPDSWSLRLKKQSCEGSWEDVLHLGEEAPSHL 

1 1 1 1 : 1 1 1 1 : 1 1 1 1 1 : 1 : 1 1 1 1 : 1 1 1 : : 1 1 1 : 1 : 1 1 1 : 
LSPSVTCGPTGLLLCRPWLTVPHCT^EVIAGDWIFQLKTQAHQGHWEEWTLDEETLNTP 


614 


Db 


599 


658 


Qy 

Db 


615 
659 


YYCQLEASACYVFTEQLGRFALVGEALSVAAAKRLKLLLFAPVACTSLEYNIRVYCLHDT 

1 1 II 1 1 : 1 : : : 1 1 1 : 1 1 : 1 : 1 1 1 1 : 1 : 1 1 1 1 1 1 1 1 1 : : 1 1 1 1 1 1 1 
CYCQLEAKSCHILLDQLGTYVFTGESYSRSAVKRLQLAIFAPALCTSLEYSLRVYCLEDT 


674 
718 


Qy 

Db 


675 
719 


HDALKEWQLEKQLGGQLIQEPRVLHFKDSYHNLRLSIHDVPSSLWKSKLLVSYQEIPFY 

1 1 1 1 1 : : 1 1 : III 1 : : 1 1 : 1 1 1 1 1 1 1 1 1 1 1 1 : 1 1 : 1 : 1 : 1 1 1 1 1 1 1 1 1 1 1 
PAALKEVLELERTLGGYLVEEPKTLLFKDS YHNLRLSLHDI PHAHWRSKLLAKYQEI PFY 


734 
778 


Qy 


735 


HIWNGTQRYLHCTFTLERVSPSTSDLACKLWVWQVEGDGQSFSINFNITKDTRFAELLAL 

1 : 1 1 1 : 1 : 1 1 1 1 1 1 1 1 1 1 : : : : 1 1 : 1 1 1 1 1 : 1 1 1 : : : : 1 III 
HVWNGSQKALHCTFTLERHSLASTEFTCKVCVRQVEGEGQIFQLHTTLA-ETPAGSLDAL 


794 


Db 


779 


837 


Qy 


795 


ESEAGVPAL— VGPSAFKIPFLIRQKIISSLDPPCRRGADWRTLAQKLHLDSHLSFFASK 
1 1 1 : 1 1 1 1 1 1 1 11111:1111 1 1 1 1 1 1 1 1 1 1 : 1 : 1 :: 1 1 : 1 
CSAPGNAATTQLGPYAFKIPLSIRQKICNSLDAPNSRGNDWRLix^ K 


852 


Db 


o o o 

boo 




Qy 


853 


PSPTAMILNLWEARHFPNGNLSQLAAAVAGLGQPDAGLFTVSEAEC 898 

111:11:11111 : | : | : | | : | : : I : : : : : : I 
ASPTGVILDLWEARQQDDGDLNSLASALEEMGKSEMLVAMTTDGDC 943 




Db 


898 





RESULT 6 

US-09-306-902A-7 

; Sequence 7 , Application US/09306902A 
; Patent No. 6277585 

GENERAL INFORMATION: 
; APPLICANT: Tessier-Lavigne, Marc 

; Leonardo , E. David 

; Hink, Lindsay 

; Masu, Masayuki 

; Kazuko, Keino-Masu 

TITLE OF INVENTION: Netrin Receptors 
NUMBER OF SEQUENCES: 9 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: SCIENCE & TECHNOLOGY LAW GROUP 
STREET: 268 BUSH STREET, SUITE 3200 
CITY: SAN FRANCISCO 
STATE: CALIFORNIA 
COUNTRY: USA 
ZIP: 94104 
COMPUTER READABLE FORM: 



; MEDIUM TYPE: Floppy disk 

COMPUTER: IBM PC compatible 
; OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: Patentln Release #1.0, Version #1.30 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/09/306, 902A 
; FILING DATE: 07-May-1999 

CLASSIFICATION: <Unknown> 
ATTORNEY/AGENT INFORMATION: 
NAME: OSMAN, RICHARD A 
REGISTRATION NUMBER: 36,627 
; REFERENCE/ DOCKET NUMBER: UC96-217 

TELECOMMUNICATION INFORMATION: 
; TELEPHONE: (415) 343-4341 

; TELEFAX: (415) 343-4342 

; INFORMATION FOR SEQ ID NO: 7: 

SEQUENCE CHARACTERISTICS: 
; LENGTH: 943 amino acids 

; TYPE: amino acid 

STRANDEDNESS: not relevant 
; TOPOLOGY: not relevant 

MOLECULE TYPE: peptide 
; SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

US-09-306-902A-7 

Query Match 53.7%; Score 2571.5; DB 3; Length 943; 

Best Local Similarity 53.3%; Pred. No. 2.5e-235; 

Matches 504; Conservative 142; Mismatches 221; Indels 79; Gaps 16; 

Qy 9 PALLGI V1AAWLRGSGAQQSATVANPVPGANPDLLPHFLVTSPEDW 68 

I : I I I I I I I : : II : I I I I I : I I I I I I I I I I I I I I : I 
Db 21 PSLAGI DSGAQ GLPDSFPSAPAEQLPHFLLEPEDAYIVKNKPVELHCRAF 70 

Qy 69 PATQIFFKCNGEWVRQVDHVIERSTDGSSGLPTMEVRINVSRQQVEKVFGLEEYWCQCVA 128 

I I I I I : I I I I I I I I I II : I I : : I I I I : I I I I I I I I : : I I I I : I I I I I I I 
Db 71 PATQIYFKCNGEWVSQKGHVTQESLDEATGLRIREVQIEVSRQQVEELFGLEDYWCQCVA 130 

Qy 129 WSSSGTTKSQKAYIRIARLRKNFEQEPLAKEVSLEQGIVLPCRPPEGIPPAEVEWLRNED 188 

I I I I I I I I I :: I I I I I I I I I I I : I I I I I I I I I : :: I I I I I I I : I I I I I I I : I I I 

Db 131 WSSSGTTKSRRAYIRIAYLRKNFDQEPLAKEVPLDHEVLLQCRPPEGVPVAEVEWLKNED 190 

Qy 189 LVT)PSLDPNVTITREHSLVVRQARLADTANYTCVAXNIVARRRSASAA^^ 248 

: : I I : I I : I : I : I : : I I I I I : I I I I I III 111111 = 111 : I I I I I I I II I I : I 
Db 191 VIDPAQDTNFLLTIDHNLIIRQARLSDTANYTCVAKNIVAKRRSTTATVIVYVNGGWSSW 250 

Qy 249 TEWSVCSASCGRGWQKRSRSCTNPAPLNGGAFCEGQNVQKTACATLCPVDGSWSPWSKWS 308 

I I I I I I I I I I I I I : I : I I I I I I I I I I I I I I I I I I I I I I : I I II I : I : I I I I I 
Db 251 AEWSPCSNRCGRGWQKRTRTCTNPAPLNGGAFCEGQACQKTACTTVCPVDGAWTEWSKWS 310 

Qy 309 ACGLDCTHWRSRECSDPAPRNGGEECQGTDLDTRNCTSDLCV HSASGPE 357 

II : I I I I I I I I I I : I I I : I I I I I :: II I III : : I : 

Db 311 ACSTECAHWRSRECMAPPPQNGGRDCSGTLLDSKNCTDGLCVLNQRTLNDPKSRPLEPSG 370 

Qy 358 DVALYVGL-IAVAVCLVLLLLVLILVYCRKKEGLDSDVADSS-ILTSGFQPVSIKPSKAD 415 

I M I I I I : I I I I : I : I :: I I I I : I : I I I I I I I I I : I : : 

Db 371 DVALYAGLWAVFVA^VXMAVGVIWRRNCRDFDTDITDSSAALTGGFHPWFKTARPS 430 



Qy 

Db 



416 NPHLL — TIQPDLSTTTTTYQGSLCPRQDGPSPKFQLTNGHLLSPL 459 

I I II : III:: I : I : II : I : I I I I I I 
431 NPQLLHPSAPPDLTASAGIYRGPVYALQDS-ADKIPMTNSPLLDPLPSLKIKVYDSSTIG 489 



Qy 460 -GGG RHTLHHSSPTSEAEEFVS RLSTQNYFRSLPRGT 495 

II I I I I : I : I : I I I 

Db 490 SGAGLADGADLLGVLPPGTYPGDFSRDTHFLHLRS ASLGSQ-HLLGLPRDP 539 

Qy 496 SNNTYGTFNFLGGRLMIPNTGISLLIPPDAIPRGKIYEIYLTLHKPEDVRLPLA-GCQTL 554 

I: Ml I I I II I I I I : I I I : I I I I : I I I : : I I : : I I III: I I I : 
Db 540 SSSVSGTFGCLGGRLTIPGTGVSLLVPNGAIPQGKFYDLYLRINKTEST-LPLSEGSQTV 598 

Qy 555 LSPIVSCGPPGVXLTRPVTLAMDHCGEPSPDSWSLRLKKQSCEGSWEDVLHLGEEAPSHL 614 

I I I I : I I I I : II I I I : I : I I I I : I I I : : I I I : I : I I I : 

Db 599 LSPSVTCGPTGLLLCRPVVLTVPHCAEVIAGDWIFQLKTQAHQGHWEEVVTLDEETLNTP 658 

Qy 615 YYCQLEASACYVFTEQLGRFALVGEALSVAAAKRLKLLLFAPVACTSLEYNIRVYCLHDT 674 

I I I I II : I :: : I I I : I I : I : I I I I : I : I I I I I I I I I :: I I I I I I I 
Db 659 CYCQLEAKSCHILLDQLGTYVFTGESYSRSAVKRLQLAIFAPALCTSLEYSLRVYCLEDT 718 

Qy 675 HDALKEWQLEKQLGGQLIQEPRVLHFKDSYHNLRLSIHDVPSSLWKSKLLVSYQEIPFY 734 

I I I I I : : I I : II I I : : I I : I I I I I I I I I I I I : I I : I : I : I I I I I I I II I I 
Db 719 PAALKEVLELERTLGGYLVEEPKTLLFKDSYHNLRLSLHDIPHAHWRSKLLAKYQEIPFY 778 

Qy 735 HIWNGTQRYLHCTFTLERVSPSTSDLACKLWVWQVEGDGQSFSINFNITKDTRFAELLAL 794 

I : I I I : I : I I I I I I I I I I : : : : I I : I I I I I : I I I : : : : I III 
Db 779 HVWNGSQKALHCTFTLERHSLASTEFTCKVCVRQVEGEGQIFQLHTTLA-ETPAGSLD^ 837 

Qy 795 E S EAGVPAL — VG P S AFKI P FL I RQ KIISSLDP P CRRGADWRT LAQKLHL D S H L S FFAS K 852 

I I I : I I I I I I I 11111:1111 I I I I I I I I I I : I . : I : : I I : I 
Db 838 CSAPGNAATTQLGPYAFKIPLSIRQKICNSLDAPNSRGNDWRLLAQKLSMDRYLNYFATK 897 

Qy 853 PSPTAMILNLWEARHFPNGNLSQLAAAVAGLGQPDAGLFTVSEAEC 898 

I II : I I : I I I I I : I : I : I I : I : : I : : : :: : I 
Db 898 ASPTGVILDLWEARQQDDGDLNSLASALEEMGKSEMLVAMTTDGDC 943 



RESULT 7 

US-09-949-016-10665 

; Sequence 10665, Application US/09949016 

; Patent No. 6812339 

; GENERAL INFORMATION: 

; APPLICANT: VENTER, J. Craig et al. 

; TITLE OF INVENTION: POLYMORPHISMS IN KNOWN GENES ASSOCIATED 

; TITLE OF INVENTION: WITH HUMAN DISEASE, METHODS OF DETECTION AND USES 

THEREOF . . 

; FILE REFERENCE: CL001307 

; CURRENT APPLICATION NUMBER: US/09/94 9,016 

; CURRENT FILING DATE:" 2000-04-14 

; PRIOR APPLICATION NUMBER: 60/241,755 

; PRIOR FILING DATE: 2000-10-20 

; PRIOR APPLICATION NUMBER: 60/237,768 

; PRIOR FILING DATE: 2000-10-03 

; PRIOR APPLICATION NUMBER: 60/231,498 

; PRIOR FILING DATE: 2000-09-08 

; NUMBER OF SEQ ID NOS : 207012 

; SOFTWARE: FastSEQ for Windows Version 4.0 



SEQ ID NO 10665 
LENGTH: 769 
TYPE: PRT 
ORGANISM: Human 
US-09-949-016-10665 

Query Match 48.8%; Score 2337.5; DB 4; Length 769; 

Best Local Similarity 56.1%; Pred. No. 3.4e-213; 

Matches 432; Conservative 136; Mismatches 179; Indels 23; Gaps 7; 

Qy 147 LRKNFEQEPLAKEVSLEQGIVTiPCRPPEGIPPAEVEWLRNEDLVDPSLDPNVyiTREHSL 206 

III I I I I I I I I I II I I : : I I I I II I I I I I I I I I : I I I :: I I I I I I I : I : I 
Db 3 LRKTFEQEPLGKEVSLEQEVLLQCRPPEGIPVAEVEWLKNEDIIDPVEDRNFYITIDHNL 62 

Qy 207 VVT^QARLADTANYTCVAKNIVARR 266 

: : : II I I : I I I I II I I I I I I I I : I : I : I I I I I I I I I I I I I I I I I I I : : I I I I : I I I : 
Db 63 IIKQARLSDTANYTCVAKNIVAKRKSTTATVI VTVNGGWSTWTEWSVCNSRCGRGYQKRT 122 

Qy 267 RSCTNPAPLNGGAFCEGQNVQKTACATLCPVT)GSWSPWSKWSACGLDCTHWRSRECSDPA 326 

I : I I I I I I I I I I I I I I I I : I I I II I I I I I I I I : I I I I I I II : I I I I I III: II 
Db 123 RTCTNPAPLNGGAFCEGQSVQKIACTTLCPVDGRWTPWSKWSTCGTECTHWRRRECTAPA 182 

Qy 327 PRNGGEECQGTDLDTRNCTSDLCVHSASGPEDVALYVGL-IAVAVCLVLLLLVLILVYCR 385 

I : I I I : : I I I : : I I I I I : : I : I I I I I I I : I I I I I I : : : I : I I : 
Db 183 PKNGGKDCDGLVLQSKNCTDGLCMQTAPDSDDVALYVGIVIAVIVCLAISWVALFVYRK 242 

Qy 386 KKEGLDSDVADSSILTSGFQPVSIKPSKADNPHLLTIQPDLSTTTTTYQGSLCPRQDGPS 445 

: I I : I I I I I I I I I : I I : : I I I : I I I : : I : I : I I 
Db 243 NHRDFESDIIDSSALNGGFQPVNIKAARQD LLAVPPDLTSAAAMYRGPVYALHD-VS 298 

Qy 446 PKFQLTNGHLLSPLGGGRHTLHHSS PTSEAEEFVSRLS TQNYF R 489 

I : I I : I I I :::::! I : I I I : I I I I : : 

Db 299 DKIPMTNSPILDPLPNLKIKVTNTSGAVTPQDDLSEFTSKLSPQMTQSLLENEALSLKNQ 358 

Qy 490 SLPRGT— SNMTYGTFNFLGGRLMIPNTGISLLIPPDAIPRGKIYEIYLTLHKPEDVRLP 547 

I I I I I : I : I I I I I I : : I I : I : I I I I I I I I : I : : I I : I : I : I : I : I I 

Db 359 SLARQTDPSCTAFGSFNSLGGHLIVPNSGVSLLIPAGAIPQGRVYEMYVTVHRKETMRPP 418 

Qy 54 8 LAGCQTLLSPIVSCGPPGVLLTRPVI LAMDHCGEPSPDSWSLRLKKQSCEGSWEDVLHLG 607 

: I I I I : I : I I I I I I I I I I I I I : I I II : I : : I : II I : : I MM: : | 
Db 419 MDDSQTLLTPWSCGPPGALLTRPVVXTMHHCADPNTEDWKILLKNQAAQGQWEDWWG 478 

Qy 608 EEAPSHLYYCQLEASACYVFTEQLGRFALVGEALSVAAAKRLKLLLFAPVACTSLEYNIR 667 

II : I I I : I I I :: I I I : II M : : M I I I M I : I I : I : I II I : I I 

Db 479 EENFTTPCYIQLDAEACHILTENLSTYALVGHSTTKAAAKRLKLAI FGPLCCSSLEYSIR 538 

Qy 668- VYCLHDTHDALKEWQLEKQLGGQLIQEPRVLHFKDSYHNLRLSIHDVPSSLWKSKLLVS 727 

I I I I I I I I I I I :: I I : I : I I I I :: II : I I I I I I I I I I I I I I : I II II II I 
Db 539 VYCLDDTQDALKEILHLERQMGGQLLEEPKALHFKGSTHNLRLSIHDIAHSLWKSKLLAK 598 

Qy 728 YQEIPFYHIWNGTQRYLHCTFTLERVSPSTSDLACKLWVWQVEGDGQSFSINFNITKDTR 787 

I I I I I I I I : I : I : II I II I II I I I I : I : I III I Nihil I : I : : :: 
Db 599 YQEIPFYHVWSGSQRNLHCTFTLERFSLNTVELVCKLCVRQVEGEGQIFQLNCTVSEEPT 658 

Qy 788 FAELLALESEAGVPALVGPSAFKIPFLIRQKIISSLDPPCRRGADWRTLAQKLHLDSHLS 847 

: I I : : : M II I II I I II : I I I I I I I I I I I I I I : II : I : 
Db 659 GIDLPLLDPANTITTVTGPSAFSIPLPIRQKLCSSLDAPQTRGHDWRMLAHKLNLDRYLN 718 



Qy 848 FFASKPSPTAMILNLWEARHFPNGNLSQLAAAVAGLGQPDAGLFTVSEAE 897 

: I I : I I I I : I I : II I I : : I I : I I I I I I I : : I : : : : I : 
Db 719 YFATKSS PTGVI LDLWEAQNFPDGNLSMLAAVIiEEMGRHETWSLAAEGQ 768 



RESULT 8 

US-09-969-532-16 

; Sequence 16, Application US/09969532 

; Patent No. 6777232 

; GENERAL INFORMATION: 

; APPLICANT: Walke, D. Wade 

; APPLICANT: Scoville, John 

; TITLE OF INVENTION: No. 6777232el Human Membrane Proteins and Polynucleotides 

Encoding the Same 

; FILE REFERENCE: LEX-0244-USA 

; CURRENT APPLICATION NUMBER: US/09/969,532 

; CURRENT FILING DATE: 2001-10-02 

; PRIOR APPLICATION NUMBER: US 60/237,280 

; PRIOR FILING DATE: 2000-10-02 

; NUMBER OF SEQ ID NOS : 33 

; SOFTWARE: FastSEQ for Windows Version 4.0 
; SEQ ID NO 16 

LENGTH: 886 

TYPE: PRT 

ORGANISM: homo sapiens 
US-09-969-532-16 

Query Match 39.6%; Score 1895; DB 4; Length 886; 

Best Local Similarity 41.2%; Pred. No. 6.1e-171; 

Matches 383; Conservative 157; Mismatches 277; Indels 112; Gaps 14; 

Qy 8 WPALLGI VLAAWLRGS GAQQ SAT VAN P VP GAN P D L L PH FL VE P EDVY I VKN K P VL L 63 

I Mill: I : : I I I I I I I : I I : I I I : I : I : I 

Db 17 WLPWLG — LCFWAAGTAAARGTDNGEALPESIPSA-PGTLPHFIEEPDDAYIIKSNPIAL 73 

Qy 64 VCKAVPATQIFFKCNGEWVT^QVT>ffVIERSTDGSSGLPTMEVRINVSRQQVEKVFGLEEYW 123 

I I I I I I I I II I I II I I I : I I I : I I I I I 11111:11111 I I : I I 
Db 74 RCKARPAMQIFFKCNGEWVT1QNEHVSEETLDESSGLKWEVFINVTRQQVEDFHGPEDYW 133 

Qy 124 CQCVAWSSSGTTKSQKAYIRIARLRKNFEQEPLAKEVSLEQGIVLPCRPPEGIPPAEVEW 183 

II I I I I I I I : I I : I I : I I I I I I I I I I : I : I I : I III I I I I I I : I Mill 

Db 134 CQCVAW S HLGT S K S RKAS VRI AYL RKN FEQD P Q GRE VP I EGMI VLHC RP P EGVPAAEVEW 193 

Qy 184 LRNEDLVT)PSLDPNvTITREHSLVVRQARLADTANYTCV^ 243 

I : M : : I II: : I : I : : I I I I I : I : I I I I : I I I I I : I I I I I- I : I I I : I 
Db 194 LKNEEPIDSEQDENIDTRADHNLIIRQARLSDSGNYTCMAANIVAKRRSLSATVVVYVDG 253 

Qy 244 GWSTWTEWSVCSASCGRGWQKRSRSCTNPAPLNGGAFCEGQNVQKTACATLCPVDGSWSP 303 

IIMIIMI 

Db -254 SWEVWSEWSVCSP 266 

Qy 304 WSKWSACGLDCTHWRSRECSDPAPRNGGEECQGTDLDTRNCTSDLCVHSASGPEDVALYV 363 

: I I I I I I : I I II I I : I : I : : I I I I I : Mill 
Db 267 ECEHLRIRECTAPPPRNGGKFCEGLSQESENCTDGLCILGIENASDIALYS 317 



QY 



364 GLIAVAVCLVLLLLVLILVYCRKKEGLDSDVADSSILTSGFQPVSIKPSKADNPHLL — T 421 



II II I : I : : : : I I : I I I I I I I I I I : I : I II 

Db 318 GL- GAAWAVAVLVI GVT L YRRS Q S D YGVDVI D S SALT GG FQT FN FKT VRQ GN S LLLNS A 376 

Qy 422 IQPDLSTTTTTYQGSLCPRQDGPSPKFQLTNGHLLSPLG 460 

: I I I I I : I I I : I Ml I : I I : I I 
Db 377 MQPDL-TVSRTYSGPIC-LQD-PLDKELMTESSLFNPLSDIKVKVQSSFMVSLGVSERAE 433 

Qy 461 GGRHTLHHSSPTSEAEEFVSRLSTQNYFRSLPRGTSNMTYGTFNFLGG 508 

II: I :: :: I I I I I I I I I I I I 

Db 434 YHGKNHSRTFPHGNNHSFSTMHPRNKM-PYIQNLS SLPTRTELRTTGVFGHLGG 486 

Qy 509 RLMI PNTGISLLIPPDAIPRGKIYEIYLTLHKPEDVRLPLAGCQTLLSPIVSCGPPGVLL 568 

I I :: I I I I : I I I I I III : I II : : : : : I I I : I I I I I : I I I I : : : 

Db 487 RLVMPNTGVSLLIPHGAIPEENSWEIYMSINQGEP-SLQSDGSEVLLSPEVTCGPPDMIV 545 

Qy 569 TRPVI LAMDHCGEPS PDSWSLRLKKQSCEGSWEDVLHLGEEAPSHLYYCQLEASACYVFT 628 

II I : I I : I : I : : I I I : : : I I I : I : : : I : | I I I : I I : I 

Db 546 TTPFALTIPHCADVSSEHWNIHLKKRTQQGKWEEVMSVEDESTS — CYCLLDPFACHVLL 603 

Qy 629 EQLGRFALVGEALSVAAAKRLKLLLFAPVACTSLEYNIRVYCLHDTHDALKEWQLEKQL 688 

: I : I I I I : : I I : I I : : I : : I I I : I I : I I I I : : I I : I I I I : 
Db 604 D S FGT YALT GE P I T DCAVKQ L KVAVFGCMS CN S L D YNLRVYCVDNT P CAFQ EWS DE RHQ 663 

Qy 689 GGQLIQEPRVLHFKDSYHNLRLSIHDVPSSLWKSKLLVSYQEIPFYHIWNGTQRYLHCTF 748 

I I I I : : I I : : I I I I : : I : : I : I : I I I : I : I I : I I : I : : I I I I 
Db 664 GGQLLEEPKLLHFKGNTFSLQISVLDIPPFLWRIKPFTACQEVPFSRVWCSNRQPLHCAF 723 

Qy 749 TLERVSPSTSDLACKLWVWQVEGDGQSFSINFNITKDTRFAELLALESEAGVPALVGPSA 808 

: I I I : I : I : I : I I : : I :: I I : : I : I : : : . I I III 

Db 724 SLERYTPTTTQLSCKICIRQLKGHEQILQVQTSILESERETITFFAQEDSTFPAQTGPKA 783 

Qy 809 FKI PFLIRQKI I S SLDPPCRRGADWRTLAQKLHLDSHLS FFASKPS PTAMI LNLWEARHF 868 

I I I I : I I I : I : : I I : I I I : I I I I : : : I I : I I : : I I : I : I I I I I I I I I 
Db 784 FKI P YSI RQRI CATFDTPNAKGKDWQMLAQKNSINRNLS YFATQSS PSAVI LNLWEARHQ 843 

Qy 869 PNGNLSQLAAAVAGLGQPDAGLFTVSEAE 897 

: I : I III: : I : I : I I : : 
Db 844 HDGDLDSLACALEEIGRTHTKLSNISESQ 872 

RESULT 9 

US-09-969-532-14 

Sequence 14, Application US/09969532 
Patent No. 6777232 
GENERAL INFORMATION: 
APPLICANT: Walke, D. Wade 
APPLICANT: Scoville, John 

TITLE OF INVENTION: No. 6777232el Human Membrane Proteins and Polynucleotides 
Encoding the Same 

FILE REFERENCE: LEX-0244-USA 
CURRENT APPLICATION NUMBER: US/09/969, 532 
CURRENT FILING DATE: 2001-10-02 
PRIOR APPLICATION NUMBER: US 60/237,280 
PRIOR FILING DATE: 2000-10-02 
NUMBER OF SEQ ID NOS : 33 

SOFTWARE: FastSEQ for Windows Version 4.0 
SEQ ID NO 14 



LENGTH: 897 

TYPE: PRT 
; ORGANISM: homo sapiens 
US-09-969-532-14 



Query Match 39.4%; Score 1886.5; DB 4; Length 897; 

Best Local Similarity 40.9%; Pred. No. 4e-170; 

Matches 384; Conservative 157; Mismatches 276; Indels 123; Gaps 15; 

Qy 8 WPALLGIVLAAWLRGS GAQQSATVANPVPGANPDLLPHFLVEPEDVYIVKNKPVLL 63 

I II I I I : I : : I I I I I I I : I I : I I I : I : I : I 

Db 17 WLPWLG — LCFWAAGTAAARGTDNGEALPES I PSA-PGTLPHFI EEPDDAYI IKSNPIAL 73 

Qy 64 VCKAVPATQI FFKCNGEWVRQVDHVT ERSTDGSSGLPTMEVRINVSRQQVEKVFGLEEYW 123 

III II I I I I I I I I I I I I : I I I : I I I I I II 111:11111 I I : I I 
Db 74 RCKARPAMQIFFKCNGEWVTiQNEHVSEETLDESSGLKVT^EW 133 



Qy 124 CQCVAWS SSGTTKSQKAYI RIARLRKNFEQEPLAKEVSLEQGIVLPCRPPEGI PPAEVEW 183 

I I I I I I I Ihlhll : I I I I I I I I I I : I : I I : I III I I I I I I : I I I I I I 
Db 134 CQCVAW S H LGT S K S RKAS VRI AYLRKN FEQ D P QGRE VP I EGMI VLH CRP P EGVPAAEVEW 193 

Qy 184 LRNEDLVTDPSLDPNVTITREHSLVVRQARLADTAN 243 

I : I I : : I I I : : I : I : : I I I I I : I : I I I I : I I I I I : I I I II I : I I I : I 
Db 194 LKNEEPI DSEQDENI DTRADHNLI I RQARLSDSGNYTCMAANI VAKRRSLSAT WVYVDG 253 

Qy 244 GWSTWTEWSVCSASCGRGWQKRSRSCTNPAPLNGGAFCEGQNVQKTACATLCPVDGSWSP 303 

I I : I I I I I I 

Db 254 SWEVWSEWSVCSP 266 



Qy 304 WSKWSACGLDCTHWRSRECSDPAPRNGGEECQGTDLDTRNCTSDLCV HS 352 

: I I I I I I : I I I I I I : I : I : : I I I I I : I 
Db 267 ECEHLRIRECTAPPPRNGGKFCEGLSQESENCTDGLCILDKKPLHEIKPQS 317 

Qy 353 AS GP EDVAL YVGL I AVAVCLVLLLLVL I LVYCRKKEGLD S DVAD S S I LT S GFQ PVS I KP S 412 

I : I I I I I II I : I : : : : I I : I I I I I I I I I I : I 

Db 318 I EN AS DIAL YS GL- GAAVVAVAVLVI GVT L YRRS Q S D YGVDVI D S SALT GGFQT FN FKTV 37 6 



Qy 413 KADNPHLL — TIQPDLSTTTTTYQGSLCPRQDGPSPKFQLTNGHLLSPLG 460 

: I II : I I I I I : I I I : I III I : I I : I I 

Db 377 RQGNSLLLNSAMQPDL-TVSRTYSGPIC-LQD-PLDKELMTESSLFNPLSDIKVKVQSSF 433 

Qy 461 GGRHTLHHSSPTSEAEEFVSRLSTQNYFRSLPRGTSN 497 

II: I : : : : II III I 

Db 434 MVSLGVSERAEYHGKNHSRTFPHGNNHSFSTMHPRNKM-PYIQNLS SLPTRTEL 486 

Qy 498 MTYGTFNFLGGRLMI PNTGI SLLI PPDAI PRGKI YEI YLTLHKPEDVRLPLAGCQTLLSP 557 

III I I I I !::| I I 1:1 I I I I III :|||::::: | | I : I I II 

Db 487 RTTGVFGHLGGRLVMPNTGVSLLIPHGAIPEENSWEIYMSINQGEP-SLQSDGSEVLLSP 545 

Qy 558 IVSCGPPGVLLTRPVILAMDHCGEPSPDSWSLRLKKQSCEGSWEDVLHLGEEAPSHLYYC 617 

I : I II I : : : I I I : I I : I : I : : I I I : : : I I I : I : : : I : I II 

Db 546 EWCGPPDMIVTTPFALTIPHCADVSSEHWNIHLKKRTQQGKWEEVMSVEDESTS — CYC 603 



Qy 

Db 



618 QLEASACYVFTEQLGRFALVGEALSVAAAKRLKLLLFAPVACTSLEYNIRVYCLHDTHDA 677 

I : I I : I : I : I I I I : : I I : I I : : I : : I I I : I I : I I I I : : I I 
604 LLDPFACHVLLDSFGTYALTGEPITDCAVKQLKVAVFGCMSCNSLDYNLRVYCVDNTPCA 663 



Qy 678 LKEWQLEKQLGGQLIQEPRVLHFKDSYHNLRLSIHDVPSSLWKSKLLVSYQEIPFYHIW 737 

: I I I I : I I I I : : I I : : I I I I : : I : : I : I : I I I : I : I I : I I : I 
Db 664 FQEWSDERHQGGQLLEEPKLLHFKGNTFSLQISVLDIPPFLWRIKPFTACQEVPFSRVW 723 

Qy 738 NGTQRYLHCTFTLERVSPSTSDLACKLWVWQVEGDGQSFSINFNITKDTRFAELLALESE 797 



Db 724 CSNRQPLHCAFSLERYTPTTTQLSCKICIRQLKGHEQILQVQTSILESERETITFFAQED 783 

Qy 798 AGVPALVGPSAFKIPFLIRQKIISSLDPPCRRGADWRTLAQKLHLDSHLSFFASKPSPTA 857 

: II I I I I I I I : I I I : I :: I I :| II: MM : : : I I : I I : : I I : I 
Db 784 ST FPAQTGPKAFKI P YS I RQRI CAT FDT PNAKGKDWQMLAQKNS INRNLS YFATQS S PSA 843 

Qy 858 MI LNLWEARH FPNGNLSQLAAAVAGLGQ P DAGL FTVS EAE 897 

: I I II I II I I : I : I MM : M I MM: 
Db 844 VILNLWEARHQHDGDLDSLACALEEIGRTHTKLSNISESQ 883 



RESULT 10 
US-09-969-532-12 

; Sequence 12, Application US/09969532 

; Patent No. 6777232 

; GENERAL INFORMATION: 

; APPLICANT: Walke, D.-Wade 

; APPLICANT: Scoville, John 

; TITLE OF INVENTION: No. 6777232el Human Membrane Proteins and Polynucleotides 
Encoding the Same 

; FILE REFERENCE: LEX-0244-USA 

; CURRENT APPLICATION NUMBER: US/09/969,532 

; CURRENT FILING DATE: 2001-10-02 

; PRIOR APPLICATION NUMBER: US 60/237,280 

; PRIOR FILING DATE: 2000-10-02 

; NUMBER OF SEQ ID NOS : 33 

; SOFTWARE: FastSEQ for Windows Version 4.0 
; SEQ ID NO 12 
; LENGTH: 900 

TYPE: PRT 
; ORGANISM: homo sapiens 
US-09-969-532-12 

Query Match 39.2%; Score 1878; DB 4; Length 900; 

Best Local Similarity 40.6%; Pred. No. 2.6e-169; 

Matches 383; Conservative 157; Mismatches 277; Indels 126; Gaps 15; 

Qy 8 WPALLGIVLAAWLRGS GAQQSATVANPVPGANPDLLPHFLVEPEDVYIVKNKPVLL 63 

I II I I I : I : MM II II : II : I I I : I : M I 

Db 17 WLPWLG — LCFWAAGTAAARGTDNGEALPESIPSA-PGTLPHFIEEPDDAYIIKSNPIAL 73 

Qy 64 VCKAVPATQIFFKCNGEWV^QVT)HVIERSTDGSSGLPTMEWINVSRQQVEKVFGLEEYW 123 

I I I II II I M I M II I I : I I I : I I M I II II I : I M II I I : I I 
Db 74 RCKARPAMQIFFKCNGEWVHQNEHVSEETLDESSGLKVREVFINVTRQQVEDFHGPEDYW 133 

Qy 124 CQCVAWSSSGTTKSQKAYIRIARLRKNFEQEPLAKEVSLEQGIVXPCRPPEGIPPAEVEW 183 

II II II I I M II : I I MM M I II II : I Ml M III I I II II M Mill 

Db 134 CQCVAWSHLGTSKSRKASVRIAYLRKNFEQDPQGREVPIEGMIVXHCRPPEGVPAAEVEW 193 

Qy 184 LRNEDLVT)PSLDPNVTITREHSLWRQAiUiADTANYT^ 243 

MM: :| I I: : | : | : : | | I M : I : II II M I I II : I II II I : M I : I 



Db 194 LKNEEP I DS EQDEN I DTRADHNLI I RQARLS DS GN YTCMAANI VAKRRS LSATVVVYVDG 253 

Qy 244 GWSTWTEWSVCSASCGRGWQKRSRSCTNPAPLNGGAFCEGQNVQKTACATLCPVDGSWSP 303 

I I : I I I I I I 

Db 254 SWEVWSEWSVCSP 266 

Qy 304 WSKWSACGLDCTHWRSRECSDPAPRNGGEECQGTDLDTRNCTSDLCVHSASGPEDVALYV 363 

: I I I I I I : I I I I I I : I : I : : I I I I I : hill 
Db 267 ECEHLRIRECTAPPPRNGGKFCEGLSQESENCTDGLCILGIENASDIALYS 317 

Qy 364 GL I AVAVCLVLL LLVL I LVYC RKKE GL D S DVAD S S I LT S G FQ P VS I K P S KA 414 

II I I I : I : : : : I I : I I I I I I I I I I : I : 

Db 318 GL- GAAWAVAVLVI GVTL YRRSQS D YGVDVI DS S ALT GGFQT FN FKTVRQAKN IMELMI 376 

Qy 415 DNPHLL — TIQPDLSTTTTTYQGSLCPRQDGPSPKFQLTNGHLLSPLG 460 

III : I I I I I : I I I : I I I I I : I I : I I 
Db 377 QEKSFGNSLLLNSAMQPDL-TVSRTYSGPIC-LQD-PLDKEIJOTESSLFNPLSDIKVKVQ 433 

Qy 461 GGRHTLHHSSPTSEAEEFVSRLSTQNYFRSLPRG 494 

I I : I : : : : I I III 
Db 434 S S FMVSLGVSERAEYHGKNHSRTFPHGNNHS FSTMHPRNKM- PYIQNLS SLPTR 486 

Qy 495 TSNMTYGTFNFLGGRLMI PNTGI SLLI PPDAI PRGKI YEI YLTLHKPEDVRLPLAGCQTL 554 

I III I I I I I : : I I I I : I I I I I III : I I I : : : : : I I I : I 
Db 487 TELRTTGVFGHLGGRLVMPNTGVSLLIPHGAIPEENSWEIYMSINQGEP-SLQSDGSEVL 545 

Qy 555 LSPIVSCGPPGVLLTRPVI LAMDHCGEPSPDSWSLRLKKQSCEGSWEDVLHLGEEAPSHL 614 

I I I I : I I I I : : : I I I : I I : I : I : : I I I : : : I I I : I : : : I : I 
Db 546 LSPEVTCGPPDMIVTTPFALTIPHCADVSSEHWNIHLKKRTQQGKWEEVMSVEDESTS — 603 

Qy 615 YYCQLEASACYVFTEQLGRFALVGEALSVAAAKRLKLLLFAPVACTSLEYNIRVYCLHDT 674 

Ml: I I : I : I : I I I I : : I I : I I : : I : : I I I : I I : I I I I : : I 
Db 604 CYCLLDPFACHVLLDSFGTYALTGEPITDCAVKQLKVAVFGCMSCNSLDYNLRWCVDNT 663 

Qy 675 HDALKEWQLEKQLGGQLIQEPRVLHFKDSYHNLRLSIHDVPSSLWKSKLLVSYQEIPFY 734 

I : I I I I : I I I I : : I I : : I I I I : : I : : I : I : I I I : I : I I : I I 
Db 664 PCAFQEWSDERHQGGQLLEEPKLLHFKGNTFSLQISVLDIPPFLWRIKPFTACQEVPFS 723 

Qy 735 HIWNGTQRYLHCTFTLERVSPSTSDLACKLWVWQVEGDGQSFSINFNITKDTRFAELLAL 794 

: I : : I I I I : I I I : I : I : I : I I : : I : : I I : : I : I 
Db 724 RVWCSNRQPLHCAFSLERYTPTTTQLSCKICIRQLKGHEQILQVQTSILESERETITFFA 783 

Qy 795 E S EAGVPALVG P S AFK I P FL I RQK 1 1 S S L D P P C RRGADW RT LAQKLH LDSHL S FFAS KP S 854 

: : : II I I I I I I I : I I I : I :: I I : I I I : I I I I : : : I I : I I : : I 
Db 784 QEDST FPAQTGPKAFKI P YS I RQRI CAT FDT PNAKGKDWQMLAQKNS INRNLS YFATQS S 843 

Qy 855 PTAMILNLWEARHFPNGNLSQLAAAVAGLGQPDAGLFTVSEAE 897 

I : I : I I I I I II I I : I : I III: : I : I : I I : : 
Db 844 PS AVI LNLWEARHQHDGDLDSLACALEEIGRTHTKLSNISESQ 886 



RESULT 11 
US-09-969-532-10 

; Sequence 10, Application US/09969532 

; Patent No. 6777232 

; GENERAL INFORMATION: 

; APPLICANT: Walke, D. Wade 



; APPLICANT: Scoville, John 

; TITLE OF INVENTION: No. 6777232el Human Membrane Proteins and Polynucleotides 

Encoding the Same 

; FILE REFERENCE: LEX-0244-USA 

; CURRENT APPLICATION NUMBER: US/09/969,532 

; CURRENT FILING DATE: 2001-10-02 

; PRIOR APPLICATION NUMBER: US 60/237,280 

; PRIOR FILING DATE: 2000-10-02 

; NUMBER OF SEQ ID NOS : 33 

; SOFTWARE: FastSEQ for Windows Version 4.0 
; SEQ ID NO 10 

LENGTH: 911 
; TYPE: PRT 

; ORGANISM: homo sapiens 
US-09-969-532-10 

Query Match 39.0%; Score 1869.5; DB 4; Length 911; 

Best Local Similarity 40.3%; Pred. No. 1.7e-168; 

Matches 384; Conservative 157; Mismatches 276; Indels 137; Gaps 16; 

Qy 8 WPALLGIVLAAWLRGS GAQQSATVANPVPGANPDLLPHFLVEPEDVYIVKNKPVLL 63 

I II I I I : I : : I I I I I I I : I I : I I f : I : I : I 

Db 17 WLPWLG— LCFWAAGTAAARGTDNGEALPESIPSA-PGTLPHFIEEPDDAYIIKSNPIAL 73 

Qy 64 VCKAVPATQIFFKCNGEWVl^QvT)HVIERSTDGSSGLPTMEvT^INVSRQQVEKVFGLEEYW 123 

I I I I I I I I I I I I I I I I I : I I I : I I I I I 11111:11111 I I : I I 
Db 74 RCKARPAMQIFFKCNGEWVHQNEHVSEETLDESSGLKWEVFINVTRQQVEDFHGPEDYW 133 

Qy 124 CQCVAWSSSGTTKSQKAYIRIARLRKNFEQEPLAKEVSLEQGIVLPCRPPEGIPPAEVEW 183 

I I I I I I I I I : I I : I I : I I I I I I I I I I : I : I I : I I I I I I I I I I : I I I I I I 
Db 134 CQCVAWSHLGTSKSRKASVT^IAYLRKNFEQDPQGREVPIEGMIVXHCRPPEGVPAAEVEW 193 

Qy 184 LRNEDLVDPSLDPNWITREHSLWRQARIADTANYTCVAKNIV^ 243 

I : I I : : I I I : : I : I : : I I I I I : I : I I I I : I I I I I : I I I I I I : I I I : I 
Db 194 LKNEEP I DS EQDENI DTRADHNLI I RQARLS DS GN YTCMAANI VAICRRSLSATVWYVDG 253 

Qy 244 GWSTWTEWSVCSASCGRGWQKRSRSCTNPAPLNGGAFCEGQNVQKTACATLCPVDGSWSP 303 

11:111111 

Db 254 SWEVWSEWSVCSP 266 

Qy 304 WSKWSACGLDCTHWRSRECSDPAPRNGGEECQGTDLDTRNCTSDLCV HS 352 

: I I I I I I : I I I I I I : I : I : : I I I I I : I 
Db 267 ECEHLRI RECTAP P PRNGGKFCEGLSQES ENCTDGLCI LDKKPLHEI KPQS 317 

Qy 353 ASGPEDVALYVGLIAVAVCLVLLLLVLILVYCRKKEGLDSDVADSSILTSGFQPVSIKPS 412 

I : I I I I I II I : I : : : : I I : I I I I I I I I I I : I 

Db 318 IENASDIALYSGL-GT^WAVAVLVIGVTLYRRSQSDYGVDVIDSSALTGGFQTFNFKTV 376 

Qy 413 KA DNPHLL — TIQPDLSTTTTTYQGSLCPRQDGPSPKFQLTNGHLL 456 

: III : I I I I I : I I I : I II I I : I I 

Db 377 RQAKNIMELMIQEKSFGNSLLLNSAMQPDL-TVSRTYSGPIC-LQD-PLDKELMTESSLF 433 

Qy 457 SPLG GGRHTLHHSSPTSEAEEFVSRLS 483 

: I I II: | : : : : | | 

Db 434 NPLSDI KVKVQS S FMVSLGVSERAEYHGKNHS RTFPHGNNHS FSTMHPRNKM- PYIQNLS 492 



Qy 



484 TQNYFRSLPRGTSNMTYGTFNFLGGRLMI PNTGI SLLI PPDAI PRGKI YEI YLTLHKPED 543 



Ill I III I I I I I I I I I : I I I I I III : I I I : : : : : I 
Db 493 SLPTRTELRTTGVFGHLGGRLVMPNTGVSLLI PHGAI PEENSWEI YMS INQGEP 546 

Qy 544 VRLPLAGCQTLLSPIVSCGPPGVLLTRPVILAMDHCGEPSPDSWSLRLKKQSCEGSWEDV 603 

I I : I I I I I : I I I I •: : : I I I : I I : I : I : : I I ! : : : I I I : I 
Db 547 -SLQSDGSEVLLSPEVTCGPPDMIVTTPFALTIPHCADVSSEHWNIHLKKRTQQGKWEEV 605 

Qy 604 LHLGEEAPSHLYYCQLEASACYVFTEQLGRFALVGEALSVAAAKRLKLLLFAPVACTSLE 663 

: : : I : I III: I I : I : I : I I I I : : I I : I I : : I : : I I I : 
Db 606 MSVEDESTS— CYCLLDPFACHVLLDSFGTYALTGEPITDCAVKQLKVAVFGCMSCNSLD 663 

Qy 664 YNIRVYCLHDTHDALKEWQLEKQLGGQLIQEPRVLHFKDSYHNLRLSIHDVPSSLWKSK 723 

I I : I I I I : : I I : I I I I : I I I I : : I I : : I I I I : : I : : I : I : I I I : I 
Db 664 YNLRVYCVDNTPCAFQEWSDERHQGGQLLEEPKLLHFKGNTFSLQISVLDIPPFLWRIK 723 

Qy 724 LLVSYQEIPFYHIWNGTQRYLHCTFTLERVSPSTSDLACKLWVWQVEGDGQSFSINFNIT 783 

: I I : I I : I : : I I I I : I I I : I : I : I : I I : : I : : I I : : I 
Db 724 PFTACQEVPFSRVWCSNRQPLHCAFSLERYTPTTTQLSCKICIRQLKGHEQILQVQTSIL 783 

Qy 784 KDTRFAELIALESEAGVPALVGPSAFKIPFLIRQKIISSLDPPCRRG7VDWRTLAQKLHLD 843 

: I : : : II I I I I I I I : I I I : I : : I I : I I I : I I I I : : 

Db 784 ESERETITFFAQEDSTFPAQTGPKAFKIPYSIRQRICATFDTPNAKGKDWQMLAQKNSIN 843 

Qy 844 SHLSFFASKPSPTAMILNLWEARHFPNGNLSQLAAAVAGLGQPDAGLFTVSEAE 897 

: I I : I I : : I I : I : I I I I I I I I I : I : I III: : I : I : I I : : 
Db 844 RNLSYFATQSSPSAVI LNLWEARHQHDGDLDSLACALEEIGRTHTKLSNISESQ 897 



RESULT 12 
US-09-969-532-32 

; Sequence 32, Application US/09969532 

; Patent No. 6777232 

; GENERAL INFORMATION: 

; APPLICANT: Walke, D. Wade 

; APPLICANT: Scoville, John 

; TITLE OF INVENTION: No. 6777232el Human Membrane Proteins and Polynucleotides 
Encoding the Same 

; FILE REFERENCE: LEX-0244-USA 

; CURRENT APPLICATION NUMBER: US/09/969,532 

; CURRENT FILING DATE: 2001-10-02 

; PRIOR APPLICATION NUMBER: US 60/237,280 

; PRIOR FILING DATE: 2000-10-02 

; NUMBER OF SEQ ID NOS : 33 

; SOFTWARE: FastSEQ for Windows Version 4.0 
; SEQ ID NO 32 

LENGTH: 655 

TYPE: PRT 
; ORGANISM: homo sapiens 
US-09-969-532-32 

Query Match 26.3%; Score 1259.5; DB 4; Length 655; 

Best Local Similarity 36.8%; Pred. No. 1.3e-110; 

Matches 262; Conservative 128; Mismatches 216; Indels 105; Gaps 11; 



Qy 

Db 



222 VAKNIVARRRSASAAVIVYVNGGWSTWTEWSVCSASCGRGWQKRSRSCTNPAPLNGGAFC 281 

: I 1111:111 II I : I I I : I I I : I I I I I I 
1 MAANIVAKRRSLSATVWYVDGSWEVWSEWSVCSP 35 



Qy 282 EGQNVQKTACATLCPVDGSWSPWSKWSACGLDCTHWRSRECSDPAPRNGGEECQGTDLDT 341 

: I I I I I I : I I II I I : I : I :: 
Db 36 ECEHLRIRECTAPPPRNGGKFCEGLSQES 64 

Qy 342 RNCTSDLCVHSASGPEDVALYVGLIAVAVCLVLLLLVLILVYCRKKEGLDSDVADSSILT 401 

III II: 1:11111 II | : | : : : : I | : I I I I I I I 

Db 65 EN CT D GL C I LGI EN AS D I AL Y S GL- GAAWAVAVLVI GVT L YRRS QS D YGVDVI D S SALT 123 

Qy 402 SGFQPVSIKPSKADNPHLL — TIQPDLSTTTTTYQGSLCPRQDGPSPKFQLTNGHLLSPL 459 

I I I : I : I I I : I I II I : I I I : I I I I I : I I : I I 
Db 124 GGFQTFNFKTVRQGNSLLLNSAMQPDL-TVSRTYSGPIC-LQD-PLDKELMTESSLFNPL 180 

Qy 460 G GGRHTLHHSSPTSEAEEFVSRLSTQN 486 

I I : I : : : : II 
Db 181 SDIKVKVQSSFMVSLGVSERAEYHGKNHSRTFPHGNNHSFSTMHPRNKM-PYIQNLS 236 

Qy 487 YFRSLPRGTSNMTYGTFNFLGGRLMIPNTGISLLIPPDAIPRGKIYEIYLTLHKPEDVRL 546 

III I III II I I I :: I I I I : I I I I I III : I I I : : : : : I I 
Db 237 SLPTRTELRTTGVFGHLGGRLVMPNTGVSLLIPHGAIPEENSWEIYMSINQGEP-SL 292 

Qy 547 PLAGCQTLLSPIVSCGPPGVLLTRPVILAMDHCGEPSPDSWSLRLKKQSCEGSWEDVLHL 606 

I : I I I I I : I I I I : : : I I I : I I : I : I : : I I I : : : I I I : I : : 
Db 293 QSDGSEVLLSPEWCGPPDMIWTPFALTIPHCADVSSEHWNIHLKKRTQQGKWEEVMSV 352 

Qy 607 GEEAPSHLYYCQLEASACYVFTEQLGRFALVGEALSVAAAKRLKLLLFAPVACTSLEYNI 666 

: I : I Ml: I I : I : I : I I U : : I I : I I : : I : : I I I : I I : 
Db 353 EDESTS — CYCLLDPFACHVLLDSFGTYALTGEPITDCAVKQLKVAVFGCMSCNSLDYNL 410 

Qy 667 RVYCLHDTHDALKEWQLEKQLGGQLIQEPRVLHFKDSYHNLRLSIHDVPSSLWKSKLLV 726 

I I I I : : I I : I I I I : I I I I : : I I : : I I I I : : I : : I : I : I I I : I 
Db 411 RVYCVDNTPCAFQEWSDERHQGGQLLEEPKLLHFKGNTFSLQISVLDIPPFLWRIKPFT 470 

Qy 727 SYQEIPFYHIWNGTQRYLHCTFTLERVSPSTSDLACKLWVWQVEGDGQSFSINFNITKDT 786 

: I I : I I : I : : I I I I : I I I : I : I : I : I I : : I : : I I : : I : 
Db 471 ACQEVPFSRVWCSNRQPLHCAFSLERYTPTTTQLSCKICIRQLKGHEQILQVQTSILESE 530 

Qy 787 RFAELLALESEAGVPALVGPSAFKIPFLIRQKIISSLDPPCRRGADWRTLAQKLHLDSHL 846 

I : : : I I I I I I I I I : I I I : I : : I I : I I I : I I I I :: : I 

Db 531 RETITFFAQEDSTFPAQTGPKAFKIPYSIRQRICATFDTPNAKGKDWQMLAQKNSINRNL 590 

Qy 847 S F FAS K P S P TAMI LN LWEARH F PN GN L S Q LAAAVAGLGQ P D AGL FT VS EAE 897 

I : I I :: II : I : I I I I I I I I I :|:| III: :|: I :||:: 
Db 591 SYFATQSSPSAVI LNLWEARHQHDGDLDSLACALEEIGRTHTKLSNISESQ 641 



RESULT 13 
US-09-969-532-30 

; Sequence 30, Application US/09969532 

; Patent No. 6777232 

; GENERAL INFORMATION: 

; APPLICANT: Walke, D. Wade 

; APPLICANT: Scoville, John 

; TITLE OF INVENTION: No. 6777232el Human Membrane Proteins and Polynucleotides 

Encoding the Same 

; FILE REFERENCE: LEX-0244-USA 

; CURRENT APPLICATION NUMBER: US/09/969,532 



; CURRENT FILING DATE: 2001-10-02 

;. PRIOR APPLICATION NUMBER: . US 60/237,280 

; PRIOR FILING DATE: 2000-10-02 

; NUMBER OF SEQ ID NOS : 33 

; SOFTWARE: Fast SEQ for Windows Version 4.0 
; SEQ ID NO 30 

LENGTH: 666 

TYPE: PRT 
; ORGANISM: homo sapiens 
US-09-969-532-30 



Query Match 26.1%; Score 1251; DB 4; Length 666; 

Best Local Similarity 36.4%; Pred. No. 8.5e-110; 

Matches 263; Conservative 128; Mismatches 215; Indels 116; Gaps 12; 

Qy 222 VAKN I VARRRS A£> AAVI VYVNGGWS TWT EWS VC S AS CGRGWQKRS RS CTN PAPLNGGAFC 281 

: I 1111:111 II I : I I I : I I I : I I I I I I 
Db 1 MAANIVAKRRSLSATVWYVDGSWEVWSEWSVCSP 35 

Qy 282 EGQNVQKTACATLCPVDGSWSPWSKWSACGLDCTHWRSRECSDPAPRNGGEECQGTDLDT 341 

: I I I I I I : I I I I I I : I : I : : 
Db 36 ECEHLRI RECTAPPPRNGGKFCEGLSQES 64 



Qy 342 RNCTSDLCV HSASGPEDVALYVGLIAVAVCLVXLLLVLILVYCRKKEGL 390 

Mill: I hlllll | | | : | : : : : | | : 

Db 65 ENCTDGLCILDKKPLHEIKPQSIENASDIALYSGL-GAAWAVAVLVIGVTLYRRSQSDY 123 

Qy 391 DSDVADSSILTSGFQPVSIKPSKADNPHLL — TIQPDLSTTTTTYQGSLCPRQDGPSPKF 448 

I I I I I I I I I I : I : I II : I I I I I : I I I : I III I 
Db 124 GVDVIDSSALTGGFQTFNFKTVRQGNSLLLNSAMQPDL-TVSRTYSGPIC-LQD-PLDKE 180 

Qy 449 QLTNGHLLSPLG GGRHTLHHSSPTSEA 475 

: I I : I I II: I : : 

Db 181 LMTESSLFNPLSDIKVKVQSSFMVSLGVSERAEYHGKNHSRTFPHGNNHSFSTMHPRNKM 240 



Qy 476 EEFVSRLSTQNYFRSLPRGTSNMTYGTFNFLGGRLMI PNTGI SLLIPPDAI PRGKI YEI Y 535 

: : II III I III I I I I I :: I I I I : I I I I I III : I I I 

Db 241 -PYIQNLS SLPTRTELRTTGVFGHLGGRLVMPNTGVSLLIPHGAIPEENSWEIY 293 



Qy 536 LTLHKPEDVRLPLAGCQTLLSPIVSCGPPGVLLTRPVI LAMDHCGEPSPDSWSLRLKKQS 595 



Db 294 MSINQGEP-SLQSDGSEVLLSPEVTCGPPDMIVTTPFALTIPHCADVSSEHWNIHLKKRT 352 

Qy 596 CEGSWEDVLHLGEEAPSHLYYCQLEASACYVFTEQLGRFALVGEALSVAAAKRLKLLLFA 655 



Db 353 QQGKWEEVMSVEDESTS— CYCLLDPFACHVLLDSFGTYALTGEPITDCAVKQLKVAVFG 410 



Qy 656 PVACTSLEYNIRVYCLHDTHDALKEWQLEKQLGGQLIQEPRVLHFKDSYHNLRLSIHDV 715 

: : I I I : I I : I I I I : : I Mill I : I I I I : : I I : : I I I I : : I : : I : I : 
Db 411 CMSCNSLDYNLRVYCVDNTPCAFQEWSDERHQGGQLLEEPKLLHFKGNTFSLQISVLDI 470 

Qy 716 PSSLWKSKLLVSYQEIPFYHIWNGTQRYLHCTFTLERVSPSTSDLACKLWVWQVEGDGQS 775 

I I I : I : I I : I I : I : : I I I I : I I I : I : I : I : I I : : I : : I I 
Db 471 PPFLWRIKPFTACQEVPFSRVWCSNRQPLHCAFSLERYTPTTTQLSCKICIRQLKGHEQI 530 



Qy 776 FSINFNITKDTRFAELLALESEAGVPALVGPSAFKIPFLIRQKIISSLDPPCRRGADWRT 835 

: : I : I : : : II I I I I I I I : I I I : I : : I I : I I I : 



Db 531 LQVQT S I LES ERET IT FFAQEDST FPAQTGPKAFKI P YS I RQRI CAT FDT PNAKGKDWQM 590 

Qy 836 LAQKLHLDSHLSFFASKPSPTAMILNLWEARHFPNGNLSQLAAAVAGLGQPDAGLFTVSE 895 

I I I I : : : I I : I I : : I I : I : I I I I I I I I I : I : I III: : I : I : I I 
Db 591 LAQKNSINRNLSYFATQSSPSAVILNLWEARHQHDGDLDSLACALEEIGRTHTKLSNISE 650 

Qy 896 AE 897 

Db 651 SQ 652 

RESULT 14 
US-09-969-532-28 

Sequence 28, Application US/09969532 
Patent No. 6777232 
GENERAL INFORMATION: 
APPLICANT: Walke, D. Wade 
APPLICANT: Scoville, John 

TITLE OF INVENTION: No. 6777232el Human Membrane Proteins and Polynucleotides 
Encoding the Same 

FILE REFERENCE: LEX-0244-USA 
CURRENT APPLICATION NUMBER: US/09/969,532 
CURRENT FILING DATE: 2001-10-02 
PRIOR APPLICATION NUMBER: US 60/237,280 
PRIOR FILING DATE: 2000-10-02 
NUMBER OF SEQ ID NOS : 33 

SOFTWARE: FastSEQ for Windows Version 4.0 
SEQ ID NO 28 
LENGTH: 669 
TYPE: PRT 

ORGANISM: homo sapiens 
US-09-969-532-28 

Query Match 25.9%; Score 1242.5; DB 4; Length 669; 

Best Local Similarity 36.1%; Pred. No. 5.5e-109; 

Matches 262; Conservative 128; Mismatches 216; Indels 119; Gaps 12; 

Qy 222 VAKNIVARRRSASAAVIVYVNGGWSTWTEWSVCSASCGRGWQKRSRSCTNPAPLNGGAFC 281 

: I 1111:111 II I : I I I : I I I : I I I I I I 
Db . 1 MAANIVAKRRSLSATVVVYVDGSWEVWSEWSVCSP 35 

Qy 282 EGQNVQKTACATLCPVDGSWSPWSKWSACGLDCTHWRSRECSDPAPRNGGEECQGTDLDT 341 

: I I I III: I Mill: I : I : : 
Db 36 ECEHLRI RECTAP P PRNGGKFCEGLSQES 64 

Qy 342 RNCTSDLCVTISASGPEDVALYVGLIAVAVCLVLLLLVLILVYCRKKEGLDSDVADSSILT 401 

III II: I : I I I I I II I : I : : : : | | : I I I I I I I 

Db 65 ENCTDGLCI LGI ENAS DI ALYS GL- GAAWAVAVLVI GVTLYRRSQSDYGVDVI DS SALT 123 

Qy 402 SGFQPVSIKPSKA DNPHLL — TIQPDLSTTTTTYQGSLCPRQDGPS 445 

I I I : I : III : I I I I I : II I : I I I I 

Db 124 GGFQTFNFKTVRQAKNIMELMIQEKSFGNSLLLNSAMQPDL-TVSRTYSGPIC-LQD-PL 180 

Qy 446 PKFQLTNGHLLSPLG GGRHTLHHSSPT 472 

I : I I : I I I I : I 

Db 181 DKELMTESSLFNPLSDIKVKVQSSFMVSLGVSERAEYHGKNHSRTFPHGNNHSFSTMHPR 240 



Qy 473 SEAEEFVSRLSTQNYFRSLPRGTSNMTYGTFNFLGGRLMI PNTGI SLLI PPDAI PRGKI Y 532 

:: : : I I I I I I III II I I I : : I I I I : I I I I I 111 : 

Db 241 NKM-PYIQNLS SLPTRTELRTTGVFGHLGGRLVMPNTGVSLLIPHGAIPEENSW 293 

Qy 533 EIYLTLHKPEDVRLPLAGCQTLLSPIVSCGPPGVLLTRPVILAMDHCGEPSPDSWSLRLK 592 



Db 294 EIYMSINQGEP-SLQSDGSEVLLSPEVTCGPPDMIVTTPFALTIPHCADVSSEHWNIHLK 352 

Qy 593 KQSCEGSWEDVLHLGEEAPSHLYYCQLEASACYVFTEQLGRFALVGEALSVAAAKRLKLL 652 

I : : : I I I : I : : : I : I III: I I : I : I : I I I I : : I I : I I : 
Db 353 KRTQQGKWEEVMSVEDESTS — CYCLLDPFACHVLLDSFGTYALTGEPITDCAVKQLKVA 410 

Qy 653 LFAPVACTSLEYNIRVYCLHDTHDALKEWQLEKQLGGQLIQEPRVLHFKDSYHNLRLSI 712 

: I : : I I I : I I : I I II : : I I : I I I I : I I I I : : I I : : I I I I : : I : : I : 
Db 411 VFGCMSCNSLDYNLRVYCVDNTPCAFQEWSDERHQGGQLLEEPKLLHFKGNTFSLQISV 470 

Qy 713 HDVPSSLWKSKLLVSYQEIPFYHIWNGTQRYLHCTFTLERVSPSTSDLACKLWVWQVEGD 772 

I : I I I : I : I I : I I : I : : I I I I : I I I : I : I : I : I I : : I : : I 
Db 471 LDIPPFLWRIKPFTACQEVPFSRVWCSNRQPLHCAFSLERYTPTTTQLSCKICIRQLKGH 530 

Qy 773 GQSFSINFNITKDTRFAELLALESEAGVPALVGPSAFKIPFLIRQKIISSLDPPCRRGAD 832 

I : : I : I : : : I I I I I I I I I : I I I : I : : I I : I I 

Db 531 EQILQVQTSILESERETITFFAQEDSTFPAQTGPKAFKIPYSIRQRICATFDTPNAKGKD 590 

Qy 833 WRTLAQKLHLDSHLSFFASKPSPTAMILNLWEARHFPNGNLSQLAAAVAGLGQPDAGLFT 892 

|:|||| : : : I I : I I : : I I : I : I I I I I I I I I : I : I III: : I : I 
Db 591 WQMLAQKNSINRNLSYFATQSSPSAVI LNLWEARHQHDGDLDSLACALEEIGRTHTKLSN 650 

Qy 893 VSEAE 897 

: I I : : 

Db 651 ISESQ 655 



RESULT 15 
US-09-969-532-26 

; Sequence 26, Application US/09969532 

; Patent No. 6777232 

; GENERAL INFORMATION: 

; APPLICANT: Walke, D. Wade 

; APPLICANT: Scoville, John 

; TITLE OF INVENTION: No. 6777232el Human Membrane Proteins and Polynucleotides 

Encoding the Same 

; FILE REFERENCE: LEX-0244-USA 

; CURRENT APPLICATION NUMBER: US/ 09/969 , 532 

; CURRENT FILING DATE: 2001-10-02 

; PRIOR APPLICATION NUMBER: US 60/237 , 280 

; PRIOR FILING DATE: 2000-10-02 

; NUMBER OF SEQ ID NOS : 33 

; SOFTWARE: FastSEQ for Windows Version 4.0 
; SEQ ID NO 26 

LENGTH: 680 

TYPE: PRT 
; ORGANISM: homo sapiens 
US-09-969-532-26 



Query Match 25.8%; Score 1234; DB 4; Length 680; 

Best Local Similarity 35.7%; Pred. No. 3.6e-108; 



Matches 263; Conservative 128; Mismatches 215; Indels 130; Gaps 13; 



Qy 222 VAKNIVARRRSASAAVT VYVNG 281 

: I 1111:111 II I : I I I : I I I : I I I I I I 
Db 1 MAANIVAKRRSLSATVWYVDGSWEVWSEWSVCSP 35 

Qy 282 EGQNVQKTACATLCPVDGSWSPWSKWSACGLDCTHWRSRECSDPAPRNGGEECQGTDLDT 341 

: I I I III: I Mill: I : I : : 
Db 36 ECEHLRI RECTAPPPRNGGKFCEGLSQES 64 

Qy 342 RNCTSDLCV HSASGPEDVALYVGLIAVAVCLVLLLLVLILVYCRKKEGL 390 

Mill: I I : I I I I I | | | : | : : : : | | : 

Db 65 ENCTDGLCILDKKPLHEIKPQSIENASDIALYSGL-GAAWAVAVLVIGVTLYRRSQSDY 123 

Qy 391 DSDVADSSILTSGFQPVSIKPSKA DNPHLL — TIQPDLSTTTTTYQ 434 

I I I I I I I I I I : I : III : I I I I I : I I 

Db 124 GVDVI D S SALT GG FQT FN FKT VRQAKN I ME LMI Q E K S FGN S LL LN S AMQ P DL- TVS RT Y S 182 

Qy 435 GSLCPRQDGPSPKFQLTNGHLLSPLG G 461 

I : I I I I I : I I : I I I 
Db 183 GPIC-LQD-PLDKELMTESSLFNPLSDIKVKVQSSFMVSLGVSERAEYHGKNHSRTFPHG 240 

Qy 462 GRHTLHHSSPTSEAEEFVSRLSTQNYFRSLPRGTSNMTYGTFNFLGGRLMIPNTGISLLI 521 

I : I : : : : II III I III I M I I : : I M I : M I I 

Db 241 NNHSFSTMHPRNKM-PYIQNLS SLPTRTELRTTGVFGHLGGRLVMPNTGVSLLI 293 

Qy 522 PPDAIPRGKIYEIYLTLHKPEDWLPIAGCQTLLSPIVSCGPPGVLLTRPVIIAMDHCGE 581 

I III : || | : : : : : | | | : | | I I I : M I I : : : I I I : M : 

Db 294 PHGAIPEENSWEIYMSINQGEP-SLQSDGSEVLLSPEVTCGPPDMIVTTPFALTIPHCAD 352 

Qy 582 PSPDSWSLRLKKQSCEGSWEDVLHLGEEAPSHLYYCQLEASACYVFTEQLGRFALVGEAL 641 

I : I : : I M : : : I I I : I : : : I : I II I : MM : I : I I I I : 
Db 353 VSSEHWNIHLKKRTQQGKWEEVMSVEDESTS — CYCLLDPFACHVLLDSFGTYALTGEPI 410 

Qy 642 SVAAAKRLKLLLFAPVACTSLEYNIRVYCLHDTHDALKEWQLEKQLGGQLIQEPRVLHF 701 

: I I : I I : : I : : I II : I I : I I II : : I I MM I : II I I : M I : M I I 
Db 411 TDCAVKQLKVAVFGCMSCNSLDYNLRVYCVDNTPCAFQEWSDERHQGGQLLEEPKLLHF 470 

Qy 702 KDSYHNLRLSIHDVPSSLWKSKLLVSYQEIPFYHIWNGTQRYLHCTFTLERVSPSTSDLA 761 

I : : I : : I : I : I II : I : *l I M I M : : III I M II MM: I : 
Db 471 KGNT FS LQI SVLDI P P FLWRI KP FTACQEVP FS RVWCSNRQPLHCAFS LERYT PTTTQLS 530 

Qy 762 CKLWVWQVEGDGQS FS IN FNI TKDTRFAELLALESEAGVPALVGPSAFKI P FLI RQKI I S 821 

I I : : I : M I : M : I : : : II I I I I I II : I I I M : 

Db 531 CKI CI RQLKGHEQI LQVQTS I LES ERETI T FFAQEDST FPAQTGPKAFKI P YS I RQRI CA 590 

Qy 822 SLDPPCRRGADWRTLAQKLHLDSHLSFFASKPSPTAMILNLWEARHFPNGNLSQLAAAVA 881 

: I I M I I : II I I :: M I : M : : I I M M I I I I I II I MM II I : 
Db 591 T FDT PNAKGKDWQMLAQKNS INRNLS YFATQS S PSAVI LNLWEARHQHDGDLDS LACALE 650 

Qy 882 GLGQPDAGLFTVSEAE 897 

M: I Ml:: 
Db 651 EIGRTHTKLSNISESQ 666 



Search completed: March 1, 2005, 09:05:46 
Job time : 57.8066 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2005 Compugen Ltd. 



OM protein - protein search, using sw model 



Run on: 



March 1, 2005, 08:46:18 



Search time 35.0558 Seconds 

(without alignments) 

2464.715 Million cell updates/sec 



Title: 

Perfect score: 
Sequence : 



US-10-624-932-2 
4791 

1 MAVRPGLWPALLGI VLAAWL . 



. AVAGLGQ P DAGL ETVS EAEC 898 



Scoring table: BLOSUM62 

Gapop 10.0 , Gapext 0.5 

Searched: 283416 seqs, 96216763 residues 

Total number of hits satisfying chosen parameters: 283416 



Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



Database : PIRJ79:* 
1: pirl:* 
2: pir2:* 
3: pir3:* 
4: pir4:* 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 
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ALIGNMENTS 



RESULT 1 
T32541 

unc-5 protein - Caenorhabditis elegans 
C; Species: Caenorhabditis elegans 

C;Date: 29-Oct-1999 #sequence_revision 29-Oct-1999 #text_change 09-Jul-2004 
C;Accession: T32541 
R;Latreille, P. 

submitted to the EMBL Data Library, December 1997 

A; Description: The sequence of C. elegans cosmid B0273. 

A; Reference number: Z21187 

A;Accession: T32541 

A; Status: preliminary; translated from GB/EMBL/DDBJ 
A; Molecule type: DNA 
A; Residues: 1-919 <LAT> 

A; Cross-references: UNI PROT : 044171 ; EMBL : AF036698 ; PIDN : AAB88355 . 1 ; 
GSPDB:GN00022; CESP : B0273 . 4a 

A; Experimental source: strain Bristol N2; clone B0273 
C; Genetics : 

A;Gene: unc-5; CESP : B0273 . 4a 
A; Map position: 4 



A;Introns: 41/3; 108/1; 142/3; 201/1; 323/2; 553/1; 858/3 
C;Superfamily: unc-5 protein; immunoglobulin homology; SH3 homology; 
thrombospondin type 1 repeat homology 

Query Match 20.4%; Score 977; DB 2; Length 919; 

Best Local Similarity 28.7%; Pred. No. 2.3e-62; 

Matches 265; Conservative 168; Mismatches 379; Indels 110; Gaps 31; 

Qy 49 EPEDv^IVTtNKPVXLVCKAVPAT^ — STDGSSGLPTMEVRI 106 

: I : I : : : I I I : I I : I I I : I : I I : : I : I I I : I I : I I : :: : 
Db 9 QPKSGYVIRNKPLRLQCRANHATKIRYKCSSKWID — DSRIEKLIGTDSTSGVGYIDASV 66 

Qy 107 NVSRQQVEKVFGLEEYWCQCVAWSSSG TTKSQKAYIRIARLRKNFEQEPLAKEVS 161 

: : I I I : : : : I I I I II I . I : : I : I I : I : I .: I : I 

Db 67 DI S RI DVDT S GHVDAFQCQC YA SGDDDQDWAS DVATVHLAYMRKHFLKS PVAQRVQ 123 

Qy 162 LEQGIVXPCRPPEGIPPAE\^LRNEDLVT)PSLDPNVYIT^ 221 

: I I I : I I I II : I : : : I I III : I I : : I I I : I : I I I I 
Db 124 EGTTLQLPCQAPESDPKAELTWYKDGVWQP — DANVTRASDGSLIMSAARLSDSGNYTC 181 

Qy 222 VAKNIVARRRSASAAVIVYVNGGWSTWTEW-SVCSASCG RGWQKR 265 

II: I : : I : I I : I I I I I : I I I I : I 

Db 182 EATNVANSRKTDPVWQIYVT5GGWSEWSPWIGTCHVDCPLLRQHAHRIRDPHDVLPHQRR 241 

Qy .266 SRSCTNPAPLNGGAFCEGQNVQKTACATLCPVDGSWSPWSKWSACGLDCTHWRSRECSDP 325 

: I : I I I I I I I I : I : I : : I I : I I I I I I I I I I I : I : I I : I 

Db 242 TRTCNNPAPLNDGEYCKGEEEMTRSCKVPCKLDGGWSSWSDWSACSSSCHRYRTRACTVP 301 

Qy 326 APRNGGEECQGTDLDTRNCTSDLCVHSASG — PEDVALYVGLIAVAVCLVLLLLVLILVY 383 

I I I I : I I I I I : I : I I :| I hi : :: : :| :| : 

Db 302 PPMNGGQPCFGDDLMTQECPAQLCTADS SRIVI SDTAVYGSVAS I FIVAS FI LAILAMFC 361 

Qy 384 CR KKEGLDSDVADSSILTSGFQPVSIKPSKADNPHLLTI 422 

I . I ... I . | . . I . . I 

Db 362 CKRGNSKKSKPLKPQKMNSEKAGGIYYS EPPGVRRLLLEHQHGTLLGEKISSCSQYF 418 

Qy 423 -QPDLSTTTT TYQGSLCPRQDGPSPKFQLTNGHLLSPLGGGRHTLHHSSPT-SE 474 

I I : I I : I II I I I I I I I I : 

Db 419 EPPPLPHSTTLRSGKSAFSGYSSTRNAGSRAALIQECSSSSSGSGGKRTMLRTSSSNCSD 478 

Qy 475 AEEFVSRLSTQNYFRSLPRGTS-NMTYGTFNFLGGRLMI PNTGISLLIPPDAIPRGKIYE 533 

: : : I III: : I I I : : I I : : I I : I : 

Db 479 DDNYATLYDYMEDKSVXGLDTSQNIVAAQIDSNGARLSLSKSGARLIVPELAVEGEKM — 536 

Qy 534 IYLTLHKPEDVRLPLAGCQTLLSPIVSCGPPGV LLTRPVILAMDHCGEPSP-D 585 

- : I I : : I : : I I I :: I I : I I I I :: : - I I II 

Db 537 LYLAVSDTLTDQPHLKPI ESALS PVIVI GQ C DVSMS AH DN I LRRPVWS FRHCASTFPRD 596 

Qy 586 SWSLRLKKQSCEGS-WEDVLHLGEEAPSHLYYCQLEASA CYVFTEQLGRFAL 636 

: I I : | | I I : : : I I I : : I I I : I I I I I 

Db 597 NWQFTL--YADEGSGWQKAVTIGEENLNTNMFVQFEQPGK 654 

Qy 637 VGEAL — SVAAAKRLKLLLFAPVACTSLE — YNIRVYCLHDTHDALKEWQLEKQLGGQL 692 

I | :: | | | | : | : |. | : : : : | | | | : : | | : : | : | | : | 

Db 655 AGHPRRNSLSAAKRVHLAVFGPTEMSAYRRPFELRVYCVPETGAAMESVWKQED— GSRL 712 



Qy 



693 IQEPR— VLHFKDSYHNLRLSIHDV-PSSLWKSKLLVSYQEIPFYHIWNGTQRYLHCTFT 749 



Db 


713 


LCESNDFILNEKG NLCICIEDVIPGFSCDGPEWEISETQHREV AQNGLHCSLK 


766 


Qy 


/ jU 


L ERVS P S T S D LAC KLWVWQVEG DGQ S FS 1 NFNITKDTKFAELLALESEAGVPALVGPSAF 
• ii •••i*t • •*.«•• » * l l 1 * i 


o09 


Db 


767 


• 1 •••!•! • • ■ • • ••111 • 1 

FRPKEINGSQFSTRVIVYQKASSTEPMVM — EVSNEPELYDATSEEREKGSVCV EF 


820 


Qy 


810 


KI PFLI RQKI I S SLDPPCRRGADWRTLAQKLHLDSHLSFFASKP — S PTAMI LNLWEARH 


867 






.ii iii i i i ■ i i ■ i i iiiiii iii * 
: : 1 1 : : : : II 1 : 1 1 1 1 1 : 1 1 1 1 - 1 1 1 1 1 1 1 1 1 : : : 1 : 1 1 1 1 




Db 


821 


RLPFGVKDELARLLDMPNESHSDWRGLAKKLHYDRYLQFFASFPDCSPTSLLLDLWEASS 


880 


Qy 


868 


FPNGN-LSQLAAAVAGLGQPDA 888 
: : 1 : : 1 : 1 1 1 




Db 


881 


SGSARAVPDLLQTLRVMGRPDA 902 





RESULT 2 
B44294 

unc-5 protein, long form - Caenorhabditis elegans 
N; Contains: unc-5 protein, short form 
C; Species: Caenorhabditis elegans 

C;Date: 30-Apr-1993 #sequence_revision 28-Jul-1995 #text_change 09-Jul-2004 
C;Accession: B44294; T32540; A44294 

R;Leung-Hagesteijn, C; Spence, A.M.; Stern, B.D.; Zhou, Y. ; Su, M.W. ; 
Hedgecock, E.M.; Culotti, J.G. 
Cell 71, 289-299, 1992 

A; Title: UNC-5, a transmembrane protein with immunoglobulin and thrombospondin 

type 1 domains, guides cell and pioneer axon migrations in C. elegans. 

A; Reference number: A44294; MUID: 93046629 ; PMID: 1384987 

A; Contents: variety Bergerac 

A; Accession: B44294 

A; Molecule type: DNA 

A; Residues: 1-947 <LEU> 

A; Cross-references: UNI PROT : 044171; GB:S47168; NID:g258527; PIDN*: AAB2 38 67 . 1 ; 
PID:g258529 

A;Note: sequence extracted from NCBI backbone (NCBIN: 116668, NCBIN : 116670, 
NCBIN:116672, NCBIN : 116674 , NCBIN : 116676, NCBIN : 116678, NCBIN : 116680, 
NCBIN:116682, NCBIN : 116685, NCBIP : 118648) 

A; Note: authors translated the codon CTA for residue 642 as Val; sequence shown 
follows the authors 1 translation 

A; Note: mRNA lacking the first exon is equally prevalent 
R;Latreille, P. 

submitted to the EMBL Data Library, December 1997 

A; Description: * The sequence of C. elegans cosmid B0273. 

A; Reference number: Z21187 

A; Accession: T32540 

A; Status: preliminary; translated from GB/EMBL/DDBJ 
A;Molecule type: DNA 
A; Residues: 1-947 <LAT> 

A; Cross-references: EMBL : AF036698 ; PIDN : AAB88356 . 1 ; GSPDB : GN00022 ; CESP : B0273 . 4b 

A; Experimental source: strain Bristol N2; clone B0273 

C; Genetics : 

A; Gene: unc-5 

A; Map position: 4 

A;Introns: 28/1; 69/3; 136/1; 170/3; 229/1; 351/2; 581/1; 886/3 
C; Function: 



A; Description: required for guidance of pioneering axons and cells migrating 
dorsally along the body wall; proposed to be a receptor on the surface of the 
motile cells 

C; Super family: unc-5 protein; immunoglobulin homology; SH3 homology; 
thrombospondin type 1 repeat homology 

C; Keywords: alternative splicing; duplication; glycoprotein; receptor; 
transmembrane protein 

F; 30-947/Product : unc-5 protein, short form #status predicted <ALT> 
F;46-116/Domain: immunoglobulin homology <IM1> 
F; 153-211/Domain: immunoglobulin homology <IM2> 

F;229-300/Domain: thrombospondin type 1 repeat homology #status atypical <THR1> 
F;301-354/Domain: thrombospondin type 1 repeat homology <THR2> 
F;365-390/Domain: transmembrane #status predicted <TMM> 
F;512-559/Domain: SH3 homology <SH3> 

F; 53-114, 65-112, 160-209/Disulf ide bonds: #status predicted 
F;206/Binding site: carbohydrate (Asn) (covalent) #status predicted 

Query Match 20.4%; Score 977; DB 1; Length 947; 

Best Local Similarity 28.7%; Pred. No. 2.4e-62; 

Matches 265; Conservative 168; Mismatches 379; Indels 110; Gaps 31; 

Qy 49 EPEDVTIvTCNKPVXLVCKAVPATQIFFKCN^ — STDGSSGLPTMEVRI 106 

: I : I : : : I I I : I I : I I I : I : I I : : I : I I I : I I : I I : : : : 
Db 37 QPKSGYVIRNKPLRLQCRANHATKIRYKCSSKWID— DSRIEKLIGTDSTSGVGYIDASV 94 

Qy 107 NVSRQQVEKVFGLEEYWCQCVAWSSSG TTKSQKAYI RI ARLRKNFEQEPLAKEVS 161 

::|| I: :: : III I II I I : : I : I I : I : I : I : I 

Db 95 DI S RI DVDT S GHVDAFQCQC YA S GDD DQ DWAS D VAT VHLAYMRKH FLKS PVAQ RVQ 151 

Qy 162 LEQGIVXPCRPPEGIPPAEVT5WLRNEDLvT)PSLDPNW 221 

: I I I : I I I I I : I : : : I I III : I I :: I I I : I : I I I I 
Db 152 EGTTLQLPCQAPESDPKAELTWYKDGVWQP — DANVI RASDGSLIMSAARLSDSGNYTC 209 

Qy 222 VAKNIVARRRSASAAVIVYVNGGWSTWTEW-SVCSASCG RGWQKR 265 

II: I:: I : I I : I I I I I : I I I |:| 
Db 210 EATNVANSRKTDPVEVQIYVDGGWSEWSPWIGTCHVDCPLLRQHAHRIRDPHDVLPHQRR 269 

Qy 266 SRSCTNPAPLNGGAFCEGQNVQKTACATLCPVDGSWSPWSKWSACGLDCTHWRSRECSDP 325 

: I : 1 I I I I II I : I : I : : I I : I I I I I I I I I I I : I : I I : I 

Db 270 TRTCNNPAPLNDGEYCKGEEEMTRSCKVPCKLDGGWSSWSDWSACSSSCHRYRTRACTVP 329 

Qy 326 APRNGGEECQGTDLDTRNCTSDLCVHSASG — P EDVAL YVGL I AVAVC LVLLL LVL I LVY 383 

I I I I: I I I I I : I : I I : I I hi : :: : : I : I : 

Db 330 PPMNGGQPCFGDDLMTQECPAQLCTADS SRIVI SDTAVYGSVAS I FI VAS FILAILAMFC 389 

Qy 384 CR KKEGLDSDVADSSILTSGFQPVSIKPSKADNPHLLTI 422 

I : | : : : | : | : : | : : : : | : 
Db 390 CKRGNSKKSKPLKPQKMNSEKAGGIYYS EPPGVRRLLLEHQHGTLLGEKISSCSQYF 446 

Qy 423 -QPDLSTTTT TYQGSLCPRQDGPSPKFQLTNGHLLSPLGGGRHTLHHSSPT-SE 474 

I I : I I : I II I I I I I I I I : 

Db 447 EPPPLPHSTTLRSGKSAFSGYSSTRNAGSRAALIQECSSSSSGSGGKRTMLRTSSSNCSD 506 

Qy 475 AEEFVSRLSTQNYFRSLPRGTS-NMTYGTFNFLGGRLMI PNTGISLLI PPDAI PRGKI YE 533 

: : : I III: : I I I : : I I : : I I : I : 

Db 507 DDNYATLYDYMEDKSVLGLDTSQNIVAAQIDSNGARLSLSKSGARLIVPELAVEGEKM— 564 



Qy 534 IYLTLHKPEDVRLPLAGCQTLLSPIVSCGPPGV LLTRPVI LAMDHCGEPSP-D 585 

-: I I : : I : : I I I : : I I : I I I I : : : II II 

Db 565 LYLAVSDTLTDQPHLKPIESALSPVIVTGQCDVSMSAHDNILRRPVWS FRHCASTFPRD 624 

Qy 586 SWSLRLKKQSCEGS-WEDVLHLGEEAPSHLYYCQLEASA CYVFTEQLGRFAL 636 

: I I : I I I I : : : I I I : : I I I : I I I I I 

Db 625 NWQFTL — YADEGSGWQKAWIGEENLNTNMFVQFEQPGKKNDGFGWCHVTOT 682 

Qy 637 VGEAL — SVAAAKRLKLLLFAPVACTSLE — YNIRVYCLHDTHDALKEWQLEKQLGGQL 692 

I I : : I I I I : I : I I : : : : I I I I : : I I : : I : I I : I 

Db 683 AGHPRRNSLSAAKRVHIAVFGPTEMSAYRRPFELRVYCVPETGAAMESVWKQED — GSRL 740 

Qy 693 IQEPR — VLHFKDSYHNLRLSIHDV-PSSLWKSKLLVSYQEIPFYHIWNGTQRYLHCTFT 749 

: I : I : I I I : I I I I : I I : I III: 
Db 741 LCESNDFI LNEKG NLCICIEDVIPGFSCDGPEWEISETQHRFV AQNGLHCSLK 794 

Qy 750 LERVSPSTSDLACKLWWQVEGDGQSFSINFNITKDTRFAELLALESEAGVPALVGPSAF 809 

: I : : : I : I : : : : : : : I I I : I 
Db 795 FRPKEINGSQFSTRVIVYQKASSTEPMVM — EVSNEPELYDATSEEREKGSVCV EF 84 8 

Qy 810 KI PFLI RQKI I S S LDPPCRRGADWRTLAQKLHLDSHLS FFAS KP — S PTAMI LNLWEARH 867 

: : I I : : : : III : I I I I I : I I I I : I I I I I I I I I ::: I : I I I I 
Db 849 RLPFGVKDELARLLDMPNESHSDWRGLAKKLHYDRYLQFFASFPDCSPTSLLLDLWEASS 908 

Qy 868 FPNGN-LSQLAAAVAGLGQPDA 888 

: : I : : I : I I I 
Db 909 SGSARAVPDLLQTLRVMGRPDA 930 



RESULT 3 
T00026 

brain-specific angiogenesis inhibitor 1 - human 
N;Alternate names: BAI1 protein 
C; Species: Homo sapiens (man) 

C;Date: 22-Jan-1999 #sequence_revision 22-Jan-1999 #text_change 09-Jul-2004 
C; Accession: TO 002 6 

R;Nishimori, H. ; Shiratsuchi, T.; Urano, T.; Kimura, Y. ; Kiyono, K. ; Tatsumi, 
K. ; Yoshida, S.; Ono, M. ; Kuwano, M. ; Nakamura, Y. 
submitted to the EMBL Data Library, June 1997 
A; Reference number: Z 14 064 
A; Accession: T00026 

A; Status: translated from GB/EMBL/DDBJ 
A; Molecule type: mRNA 
A; Residues: 1-1584 <NIS> 

A; Cross-references: UNIPROT :014514 ; EMBL: ABO 052 97 ; NID: dll75078 ; PID:dl024528 

A; Experimental source: brain 

C; Genetics: 

A; Gene: GDB : BAI 1 

A;Cross-references: GDB: 9838088; OMIM: 602682 
A; Map position: 8q24-8q24 

F; 4 08-4 62 /Domain : thrombospondin type 1 repeat homology <THR3> 

Query Match 6.2%; Score 298.5; DB 2; Length 1584; 

Best Local Similarity 33.5%; Pred. No. 5.2e-13; 

Matches 78; Conservative 35; Mismatches 91; Indels 29; Gaps 11; 



Qy 124 CQCVAWSSSGTTKSQKAYIRIARLRKNFEQEPLAKEVSLEQGIVLPCRPPEGIPPAEVEW 183 



I I : I I I : : I I : : I I I : I I I I I I I I I 
Db 309 CNREACGPAGRTSSRSQSLRSTDARR REELGDEL QQFGFPA-P.QTGDPAAE-EW 360 

Qy 184 LRNEDLVDPSLDPNVYITREHSLWRQ7VRIADTANYTCVAKNIVARRRSASAAVIVYVNG 243 

: II : I I I :::|: : :| : : : |:| 

Db 361 —SPWSVCSSTCGEGWQTR TRFCVS S S YSTQCSGPLREQRLCNNSAVCPVHG 410 

Qy 244 GWSTWTEWSVCSASCGRGWQKRSRSCTNPAPLNGGAFCEGQNVQKTAC-ATLCP VDG 299 

I I : I I : I I : : I I I I : : I : I : I | | | III I | Ml III 
Db 411 AWDEWSPWSLCSSTCGRGFRDRTRTCR — PPQFGGNPCEGPEKQTKFCNIALCPGRAVDG 468 

Qy 300 SWSPWSKWSACGLDCT HWRSRECSDPAPRNGGEECQGTDLDTRNCTSDLC 349 

: I : I I I I I I I : I : I I I : I : I I I I I I : : I I : I I 

Db 469 NWNEWS SWSACSASCSQGRQQRTRECNGPS — YGGAECQGHWVET RDC FLQQC 519 



RESULT 4 
TSHUP2 

thrombospondin 2 precursor - human 
C; Species: Homo sapiens (man) 

C;Date: 19-May-1995 #sequence_revision 03-Aug-1995 #text_change 09-Jul-2004 
C;Accession: A47379; A42173 
R;LaBell, T.L.; Byers, P.H. 
Genomics 17, 225-229, 1993 

A; Title: Sequence and characterization of the complete human thrombospondin 2 

cDNA: potential regulatory role for the 3 1 untranslated region. 

A; Reference number: A47379; MUID: 94010892 ; PMID: 8406456 

A;Accession: A47379 

A;Molecule type: mRNA 

A; Residues: 1-1172 <LAB> 

A; Cross-references: UNIPROT : P35442 ; GB:L12350; NID:g307505; PIDN : AAA03703 . 1; 
PID:g307506 

R; LaBell, T.L.; Milewicz, D.J.; Disteche, CM. ; Byers, P.H. 
Genomics 12, 421-429, 1992 

A; Title: Thrombospondin II: partial cDNA sequence, chromosome location, and 

expression of a second member of the thrombospondin gene family in humans. 

A; Reference number: A42173; MUID : 92217961; PMID: 1559694 

A;Accession: A42173 

A;Molecule type: mRNA 

A; Residues: 560-1172 <LA2> 

A; Cross-references : GB:M81339 

A; Experimental source: fibroblast 

A;Note: sequence extracted from NCBI backbone (NCBIN : 95091, NCBIP: 95096) 
C; Genetics: 

A; Gene: GDB:THBS2; TSP2 

A; Cross-references : GDB: 128789; OMIM: 188061 . .. 

A;Map position: 6q27-6q27 

C; Complex: homotrimer, disulfide linked 

C; Function: 

A; Description: participates in cell migration and adhesion, and in platelet 
aggregation 

C; Superfamily: thrombospondin 1; EGF homology; thrombospondin type 1 repeat 
homology; von Willebrand factor type C repeat homology 
C; Keywords: beta-hydroxyasparagine; calcium binding; cell adhesion; 
glycoprotein; t rimer 

F; 1-18/Domain: signal sequence #status predicted <SIG> 
F;19-1172/Product: thrombospondin 2 #status predicted <MAT> 



F;319-377/Domain: von Willebrand factor type C repeat homology <VWC> 
F; 38 0-4 3 1/Domain :. thrombospondin type 1 repeat homology <THR1> 
F;436-492/Domain: thrombospondin type 1 repeat homology <THR2> 
F; 493-549/Domain: thrombospondin type 1 repeat homology <THR3> 
F;553-588/Domain: EGF homology <EGF1> 
F; 652-691/Domain: EGF homology <EGF> 
F;928-930/Region: cell attachment (R-G-D) motif 

F; 151, 316, 330, 457,584, 710, 1069/Binding site: carbohydrate (Asn) (covalent) 
#status predicted 

F; 167-226/Disulf ide bonds: #status predicted 

F;266, 270/Disulfide bonds: interchain #status predicted 

F;612/Modified site: erythro-beta-hydroxyasparagine (Asn) #status predicted 

Query Match 6.2%; Score 296.5; DB 1; Length 1172; 

Best Local Similarity 30.5%; Pred. No. 4.9e-13; 

Matches 78; Conservative 28; Mismatches 105; Indels 45; Gaps 9; 

Qy 209 RQARLADTAN YT CVAKN I VARRRS AS AA- VI VYVNGGWS TWT EWS VC S AS CGRGWQKRS R 267 

: : I I : I I : : I I II : : I I I I I : I I I I : I I I II 

Db 403 QRGRSCDVTSNTCLGPSIQTRACSLSKCDTRIRQDGGWSHWSPWSSCSVTCGVGNITRIR 462 

Qy 268 SCTNPAPLNGGAFCEGQNVQKTAC-ATLCPVDGSWSPWSKWSACGLDCT HWRSRECS 323 

I : I I II I : I : I I I I : I I I I I I I I I I I : I I : I I : 

Db 463 LCNS PVPQMGGKNCKGSGRETKACQGAPCPI DGRWS PWS PWSACT VTCAGGI RERTRVCN 522 

Qy 324 DPAPRNGGEECQGTDLDTRNCTSDLCVTiSASGPEDVALWGLIAVAVCLVLLLLVLILVY 383 

I I : I I : I I : : I I III II 
Db 523 SPEPQYGGKACVGDVQERQMCNKRSC PVDGCLSNPCFPGAQC 564 

Qy 384 CRKKEGLDSDVADSSILTSGFQPVSI — KPSKADNPHLLTIQPDLSTTTT TYQ 434 

I I I : II II : :: : ||: :|: I 

Db 565 SSFPDGS-WSCGFCPVGFLGNGTHCEDLDECALVPDICFSTSKVPRCVNTQP 615 

Qy 435 GSLC PRQDGPSP 446 

I I I I I I 

Db 616 GFHCLPCPPRYRGNQP 631 



RESULT 5 
JC5928 

semaphorin F precursor - human 
C; Species: Homo sapiens (man) 

C;Date: 10-Apr-1998 #sequence_revision 08-May-1998 #text_change 09-Jul-2004 
C; Accession: JC5928 

R; Simmons, A.D.; Pueschel, A.W.; McPherson, J.D.; Overhauser, J.; Lovett, M. 
Biochem. Biophys . Res. Commun. 242, 685-691, 1998 

A; Title: Molecular cloning and mapping of human semaphorin F from the Cri-du- 
chat candidate interval. 

A; Reference number: JC5928; MUID: 98125554 ; PMID: 9464278 
A; Accession: JC5928 

A; Status: nucleic acid sequence not shown 
A; Molecule type: mRNA 
A; Residues: 1-1074 <SIM> 

A; Cross-references: UNIPROT:Q13591; GB:U52840; NID: g2772583; PIDN : AAC09473 . 1 ; 
PID:g2772584 

A; Experimental source: brain 



C; Comment: This protein disrupts normal brain development and leads to some of 
the features of Cri-du-chat. 
C; Genetics : 
A; Gene: semaf 

C; Super family: human semaphorin F; thrombospondin type 1 repeat homology 

F; 1-20/Domain: signal sequence #status predicted <SIG> 

F; 50-533/Domain: semaphorin #status predicted <SEM> 

F; 8 4 0-8 9 6/ Domain : thrombospondin type 1 repeat homology <THR3> 

F; 971-993/Domain: transmembrane #status predicted <TMM> 

Query Match 6.1%; Score 293; DB 2; Length 1074; 

Best Local Similarity 45.8%; Pred. No. 7.8e-13; 

Matches 54; Conservative 11; Mismatches 49; Indels 4; Gaps 2; 

Qy 241 VNGGWSTWTEWSVCSASCGRGWQKRSRSCTNPAPLNGGAFCEGQNVQKTACATL-CPVDG 299 

I I I I I I I I I I 1 111:111111 II II::: I I I I I I II 
Db 783 VNGAWSAWTSWSQCSRDCSRGIRNRKRVCNNPEPKYGGMPCLGPSLEYQECNTLPCPVDG 842 

Qy 300 SWSPWSKWSACGLDC THWRSRECSDPAPRNGGEECQGTDLDTRNCTSDLCVHSAS 354 

I I I I I : I I : I : I I I : I I I I I : I I : I : I II 
Db 843 VWSCWSPWTKCSATCGGGHYMRTRSCSNPAPAYGGDICLGLHTEEALCNTQPCPESWS 900 

RESULT 6 
A42587 

thrombospondin 2 precursor - mouse 
C; Species: Mus musculus (house mouse) 

C;Date: 04-Mar-1993 #sequence__revision 18-Nov-1994 #text_change 09-Jul-2004 
C;Accession: A42587; A39851 

R;Laherty, CD. ; O'Rourke, K.; Wolf, F.W.; Katz, R. ; Seldin, M.F.; Dixit, V.M. 
J. Biol. Chem. 267, 3274-3281, 1992 

A; Title: Characterization of mouse thrombospondin 2 sequence and expression 
during cell growth and development. 

A; Reference number: A42587; MUID : 92147683; PMID: 1371115 
A;Accession: A42587 

A; Status: preliminary; not compared with conceptual translation 
A;Molecule type: nucleic acid 
A; Residues: 1-1172 <LAH> 

A; Cross-references: UNIPROT : Q03350 ; GB:L07803; GB:M87275; NID:g340421; 
PIDN:AAA53064.1; PID:g567241 

A;Note: sequence extracted from NCBI backbone (NCBIP : 81502 ) 

R;Bornstein, P.; O'Rourke, K.; Wikstrom, K. ; Wolf, F.W. ; Katz, R. ; Li, P.; 
Dixit, V.M. 

J. Biol. Chem. 266, 12821-12824, 1991 

A; Title: A second, expressed thrombospondin gene (Thbs2) exists in the mouse 
genome. 

A; Reference number: A39851; MUID: 91302287; PMID: 1712771 
A; Access ion: A39851 
A; Status: preliminary 
A; Molecule type: mRNA 
A; Residues: 1-873 <BOR> 

A; Cross-references: GB:M64866; NID:g201994; PIDN : AAA40432 . 1 ; PID:g201995 
C; Superf amily : thrombospondin 1; EGF homology; thrombospondin type 1 repeat 
homology; von Willebrand factor type C repeat homology 
C;Keywords: calcium binding; glycoprotein 

F;319-377/Domain: von Willebrand factor type C repeat homology <VWC> 
F; 38 0-4 31/ Domain : thrombospondin type 1 repeat homology <THR1> 



F;436-492/Domain: thrombospondin type 1 repeat homology <THR2> 

F;493-549/Domain: thrombospondin type 1 repeat homology <THR3> 

F; 553-58 8/Domain: EGF homology <EGF1> x 

F;652-691/Domain: EGF homology <EGF> 

Query Match 6.1%; Score 293; DB 2; Length 1172; 

Best Local Similarity 38.0%; Pred. No. 8.7e-13; 

Matches 60; Conservative 22; Mismatches 66; Indels 10; Gaps 5; 

Qy 209 RQARLADTANYTCVAKNIVARRRS-ASAAVIVYW 267 

: : I I : I I : : I I I : I I I I I I : I I I I : I I I II 

Db 403 QRGRSCDVTSNTCLGPSIQTRTCSLGKCDTRI RQNGGWSHWS PWS SCS VTCGVGNVTRI R 462 



Qy 268 SCTNPAPLNGGAFCEGQNVQKTAC-ATLCPVDGSWSPWSKWSACGLDCT— HWRSRECS 323 

I : I I II I : I : I I I : I I I I I I I I I I I : I INI: 
Db 463 LCNSPVPQMGGKNCKGSGRETKPCQRDPCPIDGRWSPWSPWSACTVTCAGGIRERSRVCN 522 

Qy 324 DPAPRNGGEECQG — TD LDTRNCTSDLCVHSASGP 356 

I I : I I : : I I I : : I : \ I I : : I 
Db 523 SPEPQYGGKDCVGDVTEHQMCNKRSCPIDGCLSNPCFP 560 



RESULT 7 
T18856 

angiogenesis inhibitor homolog - Caenorhabditis elegans 
C; Species: Caenorhabditis elegans 

C;Date: 15-Oct-1999 #sequence_revision 15-Oct-1999 #text_change 09-Jul-2004 
C;Accession: T18856; T24653 
R;McMurray, A. 

submitted to the EMBL Data Library, July 1995 
A; Reference number: Z19031 
A; Accession: T18856 

A; Status: preliminary; translated from GB/EMBL/DDBJ 

A;Molecule type: DNA 

A; Residues: 1-1444 <WIL> 

A;Cross-references: UNIPROT : Q8MYA8 ; EMBL:Z50004; PIDN : CAA90293 . 1 ; GSPDB : GN00028 ; 
CESP:C02B4.1 

A; Experimental source: clone C02B4 
R;McMurray, A. 

submitted to the EMBL Data Library, July 1995 
A; Reference number: Z19917 
A;Accession: T24653 

A; Status: preliminary; translated from GB/EMBL/DDBJ 

A;Molecule type: DNA 

A; Residues: 1-1444 <WI2> 

A; Cross-references: EMBL:Z50006; PIDN : CAA90302 . 1; GSPDB : GN00028 ; CESP:C02B4.1 

A; Experimental source: clone T07C5 

C; Genetics : 

A; Gene : CESP : C02B4 . 1 

A;Map position: X 

A;Introns: 25/3; 70/3; 96/3; 139/3; 187/1; 234/2; 282/3; 376/2; 422/2; 478/3; 
509/3; 566/2; 625/1; 696/2; 786/3; 812/2; 878/3; 971/1; 1007/3; 1067/1; 1099/3; 
1180/3; 1273/2; 1305/1; 1363/1; 1388/2 



Query Match 5.8%; Score 276; DB 2; Length 1444; 

Best Local Similarity 27.2%; Pred. No. 2e-ll; 

Matches 73; Conservative 28; Mismatches 97; Indels 70; Gaps 12; 



Qy 

Db 

Qy 

Db 

Qy 

Db 

Qy 
Db 

Qy 
Db 



123 WCQCVAWSSSGTTKSQKAYIRIARLRKNFEQ 



1134 WSEWSSWSAC- 



EPLAKEVSLEQGIVLPCRPPEGI 176 

: : I : I I : III : I I I 
SCFSLTSTRRRFCQWDPTVQGFCAGAILEQ IPCAPGSCS 1183 



177 PPAE VEW LRNEDLVDPSLDPNVYITREHSLWRQARLADTAN 218 

II II : | | : I : | | : 

1184 PSAGGWSLWSEWSSCSKDCGDTGHQIRNRMCSEP IPSNRGAYCSG 1228 

219 YT CVAKN I VARRRS AS AAVI VYVNGGWSTWT EWS VC S AS CGRGWQKRS RS CTN PA 273 

I : II I : : : I : I I I : I I I I I : I I : I : I I I I 

1229 YS FDQRPCVMDNVCS DEK VDGGWTDWTAWSECTDYCRNGHRSRTRFCANPK 1279 

274 PLNGGAFCEGQNVQKTAC — ATLCPV-DGSWSPWSKWSACGLDC THWRSRECSDPAP 327 

I I I I I I : : I I : I I I I I I I : I I I I I I I I 

1280 PSQGGAQCTGSDFELNPCFDPARCHLRDGGWSTWSDWTPCSASCGFGVQTRDRSCSSPEP 1339 

328 RNGGEECQGTDLDTRNCTSDLCVHSASG 355 

: I I : I I I I I I : I 

1340 K-GGQSCSGLAHQTSLCDLPACDHESDG 1366 



RESULT 8 
T00326 

hypothetical protein KIAA0550 - human 
C; Species: Homo sapiens (man) 

C;Date: Ol-Feb-1999 #sequence_revision Ol-Feb-1999 #text_change 15-Mar-2004 
C; Accession: T00326 

R;Nagase, T.; Ishikawa, K. ; Miyajima, N.; Tanaka, A.; Kotani, H. ; Nomura, N. ; 
Ohara, O. 

DNA Res. 5, 31-39, 1998 

A; Title: Prediction of the coding sequences of unidentified human genes. IX. The 
complete sequences of 100 new cDNA clones from brain which can code for large 
proteins in vitro. 

A; Reference number: Z14086; MUID : 98290545 ; PMID: 9628581 
A; Accession: T00326 

A; Status: preliminary; translated from GB/EMBL/DDBJ 
A; Molecule type: mRNA 
A; Residues: 1-984 <NAG> 

A; Cross-references: EMBL : AB011122 ; NID:g3043623; PIDN : BAA25476 . 1 ; PID:g3043624 

A; Experimental source: brain 

C; Genetics : 

A; Note: KIAA0550 

F; 34 4-3 98 /Domain : thrombospondin type 1 repeat homology <THR3> 



Query Match 5.7%; Score 275; DB 2; Length 984; 

Best Local Similarity 39.0%; Pred. No. 1.4e-ll; 

Matches 57; Conservative 20; Mismatches 53; Indels 



16; Gaps 



Qy 

Db 



220 



TCVA —KNIVARRRSASAAVIVYVNGGWSTWTEWSVCSASCGRGWQKRSRSCTNPA 

III: : I : : I : I I I : I I : I I : I I I I : I : I I I I 

317 TCVSPYGTHCSGPLRESRVCNNTALCPVHGVWEEWSPWSLCSFTCGRGQRTRTRSCT— P 



6; 



273 



374 



Qy 

Db 



274 PLNGGAFCEGQNVQKTAC-ATLCPVDGSWSPWSKWSACGLDC THWRSRECSDPAPRN 329 

I II III I I I I I I I I I I I I I : I I I I I : I : I : 

375 PQYGGRPCEGPETHHKPCNIALCPVDGQWQEWSSWSQCSVTCSNGTQQRSRQCT — AAAH 432 



Qy 330 GGEECQGTDLDTRNCTSDLCVHSASG 355 

I I I I : I : : I I : I : i : I 
Db 433 GGSECRGPWAESRECYNPEC — TANG 456 



Y.; Tokino, T. 
novel genes homologous 



RESULT 9 
T00028 

brain-specific angiogenesis inhibitor 3 - human 
N; Alternate names: BAI3 protein 
C; Species: Homo sapiens (man) 

C;Date: 22-Jan-1999 #sequence_revision 22-Jan-1999 #text_change 09-Jul-2004 
C; Accession: T00028 

R; Shiratsuchi, T.; Nishimori, H.; Ichise, H.; Nakamura, 
Cytogenet. Cell Genet. 79, 103-108, 1997 
A;Title: Cloning and characterization of BA12 and BAI3, 
to brain-specific angiogenesis inhibitor 1 (BAI 1) - 
A; Reference number: Z14066; MUID : 98194217 ; PMID: 9533023 
A;Accession: T00028 

A; Status: translated from GB/EMBL/DDBJ 
A;Molecule type: mRNA 
A; Residues: 1-1522 <SHI> 

A; Cross-references: UNIPROT : 060242 ; EMBL: AB005299; NID : g3021700; 
PIDN:BAA25363.1; PID:g3021701 
A; Experimental source: brain 
C; Genetics : 
A; Gene: GDB : BAI 3 

A; Cross-references: GDB: 9838090; OMIM:602684 
A;Map position: 6ql2-6ql2 

F; 344-398/Domain: thrombospondin type 1 repeat homology <THR3> 

Query Match 5.7%; Score 275; DB 2; Length 1522; 

Best Local Similarity 39.0%; Pred. No. 2.5e-ll; 

Matches 57; Conservative 20; Mismatches 53; Indels 16; Gaps 



6; 



Qy 



Db 



220 TCVA KNIVARRRSASAAVIVYVNGGWSTWTEWSVCSASCGRGWQKRSRSCTNPA 273 

III: : I : : hi I I : I I : I I : I I I I . : I : I I I I 

317 TCVSPYGTHCSGPLRESRVCNNTALCPVHGVWEEWSPWSLCSFTCGRGQRTRTRSCT — P 374 



Qy 

Db 

Qy 

Db 



274 PLNGGAFCEGQNVQKTAC-ATLCPVDGSWSPWSKWSACGLDC THWRSRECSDPAPRN 329 

I I I I I I I I I I I I I I I I I I I : I I I I I : I : I : 

375 PQYGGRPCEGPETHHKPCNIALCPVDGQWQEWSSWSQCSVTCSNGTQQRSRQCT — AAAH 432 

330 GGEECQGTDLDTRNCTSDLCVHSASG 355 

I I I I : I :: I I : I : I : I 
433 GGSECRGPWAESRECYNPEC — TANG 456 



RESULT 10 
T00027 

brain-specific angiogenesis inhibitor 2 - human 
N;Alternate names: BAI2 protein 
C; Species: Homo sapiens (man) 

C;Date: 22-Jan-1999 #sequence_revision 22-Jan-1999 #text_change 09-Jul-2004 
C;Accession: T00027 

R;Shiratsuchi, T.; Nishimori, H.; Ichise, H.; Nakamura, Y. ; Tokino, T. 
Cytogenet. Cell Genet. 79, 103-108, 1997 



A;Title: Cloning and characterization of BAI2 and BAI3, novel genes homologous 

to brain-specific angiogenesis inhibitor 1 (BAI 1) . 

A; Reference number: Z14066; MUID : 98194217 ; PMID: 9533023 

A;Accession: T00027 

A; Status: translated from GB/EMBL/DDBJ 
A;Molecule type: mRNA 
A; Residues: 1-1572 <SHI> 

A; Cross-references: UNIPROT: 060241; EMBL: AB005298 ; NID: g3021698 ; 

PIDN:BAA2 5362.1; PID:g3021699 

A; Experimental source: brain 

C; Genetics : 

A; Gene: GDB: BAI 2 

A; Cross-references: GDB: 9838089;. OMIM: 602683 
A; Map position: Ip35-lp35 

Query Match 5.7%; Score 274.5; DB 2; Length 1572; 

Best Local Similarity 19.2%; Pred. No. 2.8e-ll; 

Matches 176; Conservative 108; Mismatches 307; Indels 327; Gaps 38; 

Qy 173 PEGIPPAEVEWLRNEDLVDPSLDPNVY 1 T REH S L WRQARL 213 

I I I : :| I: I :| :| :| I I I I 

Db 271 PEEEPKVKTQWPRSAD EPGLYMAQTGDPAAEEWSPWSVCSLTCGQGLQVR-TRS 323 

Qy 214 ADTANYTCVAKNIVARRRSASAAVIWWGGWSTWTEWSVCSASCGRGWQKRSRSCTNPA 273 

: : T : : I : : I : I I I I I : I I I I I I I : I I : I 
Db 324 CVSSPYGTLCSGPLRETRPCNNSATCPVHGVWEEWGSWSLCSRSCGRGSRSRMRTCV — P 381 

Qy 274 PLNGGAFCEGQNVQKTACA-TLCPVDGSWSPWSKWSACGLDC THWRSRECSDPAPR- 328 

I : I I III : I I : I I I : I I I I I I I I I I : I I I 
Db 382 PQHGGKACEGPELQTKLCSMAACPVEGQWLEWGPWGPCSTSCANGTQQRSRKCSVAGPAW 441 

Qy 329 NGGEECQ 335 

I I : 

Db 442 ATCTGALTDTRECSNLECPATDSKWGPWNAWSLCSKTCDTGWQRRFRMCQATGTQGYPCE 501 

Qy 336 GTDLDTRNCTSDLC— VHSASGPEDVAL 361 

I I : : I : I I III 
Db 502 GTGEEVKPCSEKRCPAFHEMCRDEYVMLMTWKKAAAGEIIYNKCPPNASGSASRRCLLSA 561 

Qy 362 YVGL I AVAVCL VLLLLVLILVYCRKKEGLDSDVADSSILTSGFQPVSIKPSKA 414 

I I I : I I : I : : : I : : : I : : I I : : : 

Db 562 QGVAYWGLPSFARCISHEYRYLYLSLREHLAKGQRMLAGEGMSQWRS-LQELLARRTYY 620 

Qy 415 DNPHLLTIQPDLSTTTTTYQGSLCPRQDGPSPKFQLT NGHLLSPLGG 4 61 

I : : : | | : : | | | | : : : I I I 

Db 621 SGDLLFSvI)ILR^^OTDTFKRATWPSADDVQRFFQWSFMvT)AENKEKWDDAQQVSP--G 678 

Qy 462 GRHTLHHSSPTSEAEEFV SRLSTQNYFRSLPRG TSNMTYGTFN 504 

II I : I : I : I I I : I : I : : I : 

Db 679 SVHLLR WEDFIHLVGDALKAFQSSLI VTDNLVISIQREPVSAVSSDITFPMRG 732 

Qy 505 FLG GRLMI PNTGI SLLI P PDAIPRGK 530 

I I I : I : I I I I : I I 

Db 733 RRGMKDWVRHSEDRLFLPKEVLSLSSPGKPATSGAAGSPGRGRGPGTVPPGPGHSHQRLL 792 

Qy 531 ; IYE-IYLTLHKPEDVRLPLAGCQTLLSPIVSCGPPGVLLTRPVIL 574 

:| : I I I I III ::: |: || |:| 



Db 793 PADPDESSYEVIGAVLYRTLGLILPPP RPPLAVTSRVMT — VTVRPPTQPPAEPLIT 847 



Qy 575 A MDHCGEPSPDSWSLRLKKQSCEGSWEDVLHLGEEAPSHLYYCQ-LEASACYV-- 626 

:: :l II : : I |: Mill: 
Db 848 VELSYIINGTTDPHCASWDYS-RADASSGDWD TENCQTLETQAAHTRC 894 

Qy 627 ETEQLGRFALVGE ALS VAAAKRLKLLLFAPVACT S LE YNI RVYCLHDTHDALKEV 681 

: I II::: | : | : : | : : I : I : I : : I I 
Db 895 QCQH L S T FAVLAQ P P KD LT L ELAG S P S VPLVT GCAVS CMALLT L LAI YA AFWRF 948 

Qy 682 VQLEKQLGGQLIQEPRVLHFKDSYHNLRLSIHDVPSSLWKSKLLVSYQEIPFYHIWNGTQ 741 

: : I : : I I I I : I : : : I I : : I 
Db 94 9 IKSERSI ILLNFCLSI — LASNI L I L VGQ S RVL S KGVCTMT A 988 

Qy 742 RYLHCTFTLERVSPSTSDLACKLWV WQVEGDG 773 

: I I I I : I I I : 

Db 989 AFLHFFF LSSFCWVLTEAWQSYLAVIGRMRTRLVRKRFLCLGWGLPALV 1037 

Qy 774 QSFSINFNITKDTRFAELLALESEAG-VPALVGPSA FKIPFLIRQKI IS 821 

: I : I I I : I I I : I I I I : I I : : I : II 

Db 1038 VAVSVGFTRTKGYGTSSYCWLSLEGGLLYAFVGPAAVIVLVNMLIGIIVFNKLMARDGIS 1097 

Qy 822 SLDPPCRRGAD WRTL 836 

I I:: I :| 
Db 1098 DKSKKQRAGSERCPWASL 1115 

RESULT 11 
A40558 

thrombospondin 1 precursor - mouse 
C; Species: Mus musculus (house mouse) 

C;Date: 05-Jun-1992 #sequence_revision 05-Jun-1992 #text_change 09-Jul-2004 
C;Accession: A40558; A37905; B42587; S68787 

R;Lawler, J.; Duquette, M. ; Ferro, P.; Copeland, N.G.; Gilbert, D.J.; Jenkins, 
N.A. 

Genomics 11, 587-600, 1991 

A;Title: Characterization of the murine thrombospondin gene. 

A; Reference number: A40558; MUID: 92128941; PMID: 1774063 

A; Accession: A40558 

A; Status : preliminary 

A;Molecule type: DNA 

A; Residues: 1-1170 <LAW> 

A; Cross-references: UNIPROT: P35441; GB:M62449; GB:M62450; GB:M62451; GB:M62452; 
GB:M62453; GB:M62454; GB:M62455; GB:M62456; GB:M62457; GB:M62458; GB:M62459; 
GB:M62460; GB:M62461; GB:M62462; GB:M62463; GB:M62464; GB:M62465; GB:M62466; 
GB:M62467; GB:M62468; GB:M62469; GB:M62470; NID:g511867; PIDN : AAA50611 . 1 ; 
PID:g511869 

R;Bornstein, P.; Alfi, D.; Devarayalu, S.; Framson, P.; Li, P. 
J. Biol.. Chem. 265, 16691-16698, 1990 

A; Title: Characterization of the mouse thrombospondin gene and evaluation of the 

role of the first intron in human gene expression. 

A;Reference number: A37905; MUID: 90375546; PMID:2398070 

A;Accession: A37905 

A; Status : preliminary 

A; Molecule type: DNA 

A; Residues: 1-490 <BOR> 



A; Cross-references: GB:J05605; GB:J05606; NID:g201991; PIDN: AAA40431 . 1 ; 
PID:g554390 

R;Laherty, CD.; O'Rourke, K. ; Wolf, F.W. ; Katz, R. ; Seldin, M.Fi; Dixit, V.M. 
J. Biol. Chem. 267, 3274-3281, 1992 

A;Title: Characterization of mouse thrombospondin 2 sequence and expression 
during cell growth and development. 

A; Reference number: A42587; MUID: 92147 683 ; PMID: 1371115 
A; Accession: B42587 

A; Status: preliminary; not compared with conceptual translation 
A; Molecule type: mRNA 

A;Residues: 1-1152 , 1 P f , 1154-1170 <LAH> 
A;Cross-ref erences : GB:M87276 

A; Note: sequence extracted from NCBI backbone (NCBIP : 81501 ) 
R;Chen, H.; Aeschlimann, D. ; Nowlen, J.; Mosher, D.F. 
FEBS Lett. 387, 36-41, 1996 

A;Title: Expression and initial characterization of recombinant mouse 
thrombospondin 1 and thrombospondin 3. 

A; Reference number: S68787; MUID: 96234 006; PMID: 8654563 

A; Accession: S 68 7 87 

A; Molecule type: protein 

A;Residues: 19-26, 1 X 28-37 <CHE> 

C; Complex: homotrimer, disulfide linked 

C; Superf amily : thrombospondin 1; EGF homology; thrombospondin type 1 repeat 

homology; von Willebrand factor type C repeat homology 

C; Keywords: calcium binding; glycoprotein; homotrimer 

F; 1^18/Domain: signal sequence #status predicted <SIG> 

F; 19-1170/Product : thrombospondin 1 #status predicted <MAT> 

F; 3 17- 375/ Domain : von Willebrand factor type C repeat homology <VWC> 

F; 37 8- 4 2 9/ Domain : thrombospondin type 1 repeat homology <THR1> 

F; 434-490/Domain: thrombospondin type 1 repeat homology <THR2> 

F; 491-547/Domain: thrombospondin type 1 repeat homology <THR3> 

F;551-586/Domain: EGF homology <EGF> 

F;248, 360, 708, 1067/Binding site: carbohydrate (Asn) (covalent) #status predicted 

Query Match 5.6%; Score 270.5; DB 2; Length 1170; 

Best Local Similarity 32.2%; Pred. No. 3.7e-ll; 

Matches 57; Conservative 24; Mismatches 71; Indels 25; Gaps 5; 

Qy 207 WRQARLADT AN YT C VAKN I VAR RRSASAAVIVYVNGGWSTWTEWSVCSASC 258 

: : : I I : I I : : I : I : I I I I I : I I I I : I 

Db 399 IQQRGRSCDSLNNRCEGSSVQTRTCHIQECDKRFKQ DGGWSHWSPWSSCSVTC 451 

Qy 259 GRGWQKRSRSCTNPAPLNGGAFCEGQNVQKTAC-ATLCPVDGSWSPWSKWSACGLDC 314 

II I I I : I : I I III: : II I I : : I I I I I I I : I 
Db 452 GDGVITRI RLCNS PSPQMNGKPCEGEARETKACKKDACPINGGWGPWS PWDI CSVTCGGG 511 

Qy 315 THWRSRECSDPAPRNGGEECQGTDLDTRNCTSDLCVHSASGPEDVALYVGLIAVAVC 371 

I I I I : : I I : I I : : I I : : I I II I III 

Db 512 VQRRS RLCNNPT PQFGGKDCVGDVTENQVCNKQDC PIDGCLSNPCFAGAKC 562 



RESULT 12 
TSHUP1 

thrombospondin 1 precursor - human 
C; Species: Homo sapiens (man) 

C;Date: 23-Aug-1987 #sequence_revision 03-Aug-1995 #text_change 09-Jul-2004 
C;Accession: A26155; A34274; A30140; A25812; A05172; A42927 



R;Lawler, J.; Hynes, R.O. 

J. Cell Biol. 103, 1635-1648, 1986 

A; Title: The structure of human thrombospondin, an adhesive glycoprotein with 
multiple calcium-binding sites and homologies with several different proteins. 
A; Reference number: A26155; MUID: 87057617 ; PMID:2430973 
A;Accession: A26155 
A; Molecule type: mRNA 
A; Residues: -1-1170 <LAW> 

A; Cross-references: UNIPROT : P07996; GB:X04665; NID:g37137; PIDN: CAA28370 . 1; 
PID:g37138 

A; Note: parts of this sequence, including the amino end of the mature protein, 
were determined by protein sequencing 
R;Laherty, CD.; Gierman, T.M.; Dixit, V.M. 
J. Biol. Chem. 264, 11222-11227, 1989 

A; Title: Characterization of the promoter region of the human thrombospondin 

gene. DNA sequences within the first intron increase transcription. 

A;Reference number: A34274; MUID: 89291870; PMID:2544587 

A;Accession: A34274 

A; Molecule type: DNA 

A; Residues: 1-166 <LAH> 

A; Cross-references : GB:J04835 

R;Hennessy, S.W.; Frazier, B.A. ; Kim, D.D.; Deckwerth, T.L.; Baumgartel, D.M.; 

Rotwein, P.; Frazier, W.A. 

J. Cell Biol. 108, 729-736, 1989 

A; Title: Complete thrombospondin mRNA sequence includes potential regulatory 
sites in the 3 f untranslated region. 

A; Reference number: A30140; MUID: 89139590; PMID:2918029 
A; Access ion: A3 0140 
A; Molecule type: mRNA 

A;Residues: 1-83, ' A' , 85-522 , 1 A' , 524-1170 <HEN> 

A; Cross-references: EMBL:X14787; NID:g37464; PIDN : CAA32889 . 1 ; PID:g37465 

A; Note: parts of this sequence, including the amino end of the mature protein, 

were determined by protein sequencing 

R;Kobayashi, S.; Eden-McCutchan, F. ; Framson, P.; Bornstein, P. 
Biochemistry 25, 8418-8425, 1986 

A; Title: Partial amino acid sequence of human thrombospondin as determined by 

analysis of cDNA clones: homology to malarial circumsporozoite proteins. 

A; Reference number: A25812; MUID: 87157592 ; PMID: 3030396 

A;Accession: A25812 

A;Molecule type: mRNA 

A;Residues: 1-83, 'A 1 , 85-397 <KOB> 

A; Cross-references: GB:M25631; NID:g538353; PIDN : AAA36741 . 1; PID:g538354 

R;Dixit, V.M.; Hennessy, S.W.; Grant, G.A. ; Rotwein, P.; Frazier, W.A. 

Proc. Natl. Acad. Sci. U.S.A. 83, 5449-5453, 1986 

A; Reference number: A05172; MUID: 86287276; PMID: 3461443 

A;Accession: A05172 

A; Molecule type: mRNA 

A;Residues: 1-83, 1 A 1 , 85-374 , 1 RC f <DIX> 

A; Cross-references: GB:M14326; NID:g340005; PIDN : AAA61237 . 1 ; PID:g553801 

A; Note: parts of this sequence, including the amino end of the mature protein, 

were determined by protein sequencing 

R;Sun, X.; Skorstengaard, K. ; Mosher, D.F. 

J. Cell Biol. 118, 693-701, 1992 

A; Title: Disulfides modulate RGD-inhibitable cell adhesive activity of 
thrombospondin . 

A;Reference number: A42927; MUID: 92348511; PMID:1379247 
A;Accession: A42927 



A; Molecule type: protein 
A; Residues: 987-1003 <SUN> 

A; Note: Cys-992 is shown to have a free sulfhydryl 
C; Genetics: 

A; Gene: GDB : THBS1; TSP1; TSP 

A; Cross-references: GDB: 120438; OMIM: 188060 

A; Map position: 15ql5-15ql5 

A;Introns: 23/1 

A; Note: the list of introns may be incomplete 
C; Complex: homotrimer, disulfide linked 
C; Function: 

A; Description: participates in cell migration and adhesion, and in platelet 
aggregation 

C; Super family : thrombospondin 1; EGF homology; thrombospondin type 1 repeat 
homology; von Willebrand factor type C repeat homology 
C; Keywords: beta-hydroxyasparagine; calcium binding; cell adhesion; 
glycoprotein; trimer 

F; 1-18/Domain: signal sequence #status predicted <SIG> 

F; 19-1 170/ Product : thrombospondin 1 #status predicted <MAT> 

von Willebrand factor type C repeat homology <VWC> 
thrombospondin type 1 repeat homology <THR1> 



F;317-375/Domain: 
F;378-429/Domain: 
F;434-490/Domain: 
F;491-547/Domain: 
F; 551-586/Domain: 
F; 650-689/Domain: 
F;926-928/Region: 



repeat homology <THR2> 
repeat homology <THR3> 



thrombospondin type 1 
thrombospondin type 1 
EGF homology <EGF1> 
EGF homology <EGF2> 
cell attachment (R-G-D) motif 
F; 171-232/Disulf ide bonds: #status predicted 

F;248, 360, 708, 1067/Binding site: carbohydrate (Asn) (covalent) #status predicted 
F; 270, 274/Disulf ide bonds: interchain #status predicted 

F; 610/Modif ied site: erythro-beta-hydroxyasparagine (Asn) #status predicted 
F; 1051/Binding site: carbohydrate (Asn) (covalent) #status absent 



Query Match 5.6%; Score 268.5; DB 1; 

Best Local Similarity 32.9%; Pred. No. 5.1e-ll; 
Matches 51; Conservative 24; Mismatches 61; 



Length 1170; 
Indels 19; Gaps 



4; 



Qy 

Db 



207 WRQARLADTANYTCVAKNIVAR RRSASAAVIVYVNGGWSTWTEWSVCSASC 258 

: : : I I : I I : : I : I : I I I I I : I I I I : I 

399 IQQRGRSCDSLNNRCEGSSVQTRTCHIQECDKRFKQ DGGWSHWSPWSSCSVTC 451 



Qy 

Db 



259 GRGWQKRSRSCTNPAPLNGGAFCEGQNVQKTAC-ATLCPVDGSWSPWSKWSACGLDC 314 

II I I I : I : I I III: : II I I : : I I I I I I I : I 

452 GDGVITRIRLCNSPSPQMNGKPCEGEARETKACKKDACPINGGWGPWSPWDICSVTCGGG 511 



Qy 

Db 



315 THWRSRECSDPAPRNGGEECQGTDLDTRNCTSDLC 34 9 

\ I I I I I I : I I : : I I : : I I 
512 VQKRS RLCNNPT PQFGGKDCVGDVTENQI CNKQDC 54 6 



RESULT 13 
A39804 

thrombospondin precursor - chicken 
C; Species: Gallus gallus (chicken) 

C;Date: 10-Sep-1999 #sequence_revision 10-Sep-1999 #text_change 09-Jul-2004 
C; Accession: A39804 

R;Lawler, J.; Duquette, M. ; Ferro, P. 
J. Biol. Chem. 266, 8039-8043, 1991 



A; Title: Cloning and sequencing of chicken thrombospondin. 
A; Reference number: A39804; MUID: 91217026; PMID:2022631 
A; Accession: A39804 
A; Status : preliminary 
A; Molecule type: mRNA 
A; Residues: 1-1178 <LAW> 

A; Cross-references: UNIPROT : P35440 ; GB:M60853; NID: g212763; ' PIDN: AAA51437 . 1; 
PID:g212764 

C; Super family : thrombospondin 1; EGF homology; thrombospondin type 1 repeat 
homology; von Willebrand factor type C repeat homology 
F;325-383/Domain: von Willebrand factor type C repeat homology <VWC> 
F;386-437/Domain: thrombospondin type 1 repeat homology <THR1> 
F;442-498/Domain: thrombospondin type 1 repeat homology <THR2> 
F; 4 9 9- 5 5 5/ Domain: thrombospondin type 1 repeat homology <THR3> 
F;658-697/Domain: EGF homology <EGF> 

Query Match 5.5%; Score 263; DB 1; Length 1178; 

Best Local Similarity 36.2%; Pred. No. 1.3e-10; 

Matches 58; Conservative 16; Mismatches 70; Indels 16; Gaps 5; 

Qy 210 QARLADTAN YT CVAKN I VARRRS - AS AAVI VYVNGGWSTWT EWS VC S AS CGRGWQKRS RS 268 

: I I I : I I I : : I I I I I : I I I I : I I I II 

Db 410 RGRSCDVTRSACTGPHIQTRMCSFKKCDHRI RQDGGWSHWSPWSSCSVTCGVGNITRIRL 469 

Qy 269 CTNPAPLNGGAFCEGQNVQKTACATL-CPVDGSWSPWSKWSACGLDC THWRSRECSD 324 

I : I I I I I I : I I I I : I I II I I I I I : I INI: 

Db 470 CN S P I PQMGGKN C VGN GRET E KC E KAP C P VN GQWG P W S P W S ACT VT C GGG I RE RS RL CN S 529 

Qy 325 PAPRNGGEECQGTDLDT RNCTSDLCVHSASGP 356 

I I : I I : I I II | : | | | : : | 

Db 530 PEPQYGGKPCVG DTKQHDMCNKRDCPIDGCLSNPCFP 566 



RESULT 14 
S29126 

properdin precursor [validated] - human 
N;Alternate names: factor P 
C; Species: Homo sapiens (man) 

C;Date: 17-Nov-2000 #sequence_revision 17-Nov-2000 #text_change 09-Jul-2004 
C;Accession: S29126; S16150; A05319; T45112; T45113 

R;Nolan, K.F.; Kaluz, S.; Higgins, J.M. G. ; Goundis, D. ; Reid, K.B.M. 
Biochem. J. 287, 291-297, 1992 

A; Title: Characterization of the human properdin gene. 

A; Reference number: S29126; MUID: 93038568; PMID: 1417780 

A;Accession: S29126 

A; Molecule type: DNA 

A; Residues: 1-469 <N0L1> 

A; Cross-references: UNIPROT : P27918 ; EMBL:X70872; NID:g35679; PIDN : CAA50220 . 1 ; 
PID:g35680 

R;Nolan, K.F.; Schwaeble, W.; Kaluz, S.; Dierich, M.P.; Reid, K.B.M. 
Eur. J. Immunol. 21, 771-776, 1991 

A; Title: Molecular cloning of the cDNA coding for properdin, a positive 
regulator of the alternative pathway of human complement. 
A;Reference number: S16150; MUID: 91184288; PMID:2009915 
A; Accession: SI 6150 
A;Molecule type: mRNA 

A; Residues: 1-456, ' R' , 458-469 <NOL2> 



A;Cross-references: EMBL:X57748 

R;Reid, K.B.M.; Gagnon, J. 

Mol. Immunol. 18, 949-959, 1981 

A; Reference number: A05319; MUID : 82195224 ; PMID: 7341961 
A; Access ion: A05319 
A;Molecule type: protein 

A; Residues: 28-53, »Q', 55-59, 'G' , 61, 1 1 1 , 63; 137-138 , 'P», 140-141, »P f ,143- 

144, f X', 146-148, *Y , ,150, 'S f ,152, 1 Y 1 , 154-156, ' XSXGXA' ; 162-163, 'EM65- 

172, 'X', 174-176, , X , ,178, 'V , 180; 223-228, 'X', 230-232, 1 GX 1 , 235-238 , 'GH',241- 

245; 248-251, »X», 253-257, , P , ,259, 'G',261, 'XPP ', 265-266, 1 X ' , 268-269 ; 280- 

285, »X\ 287-290, 'X',292, 'H 1 , 294-300, » SXXX» , 305-307, 'X 1 , 309-315, 1 K 1 , 317 ; 333- 

341,343-357, 'X 1 , 359-362, 1 EXE 1 ; 393-404, 1 QK* , 407 ; 421-427, f R', 429-443, , TKV',447- . 

448, 'XX 1 ,451, 'RX' ,454-455 <REI> 

R;Westberg, J.; Nordin-Fredrikson, G. ; Truedsson, L.; Sjoholm, A.G.; Uhlen, M. 
submitted to the EMBL Data Library, May 1997 
A; Reference number: Z22914 
A;Accession: T45112 

A; Status: translated from GB/EMBL/DDBJ 
A;Molecule type: DNA 

A;Residues: 1-54, ' X ? , 56-73, f X f , 75-99, f W 1 , 101-469 <WES1> 
A; Cross-references : EMBL : AF005665 ; PIDN : AAB63280 . 1 

A; Experimental source: genomic DNA from individual with properdin deficiency 
type II 

A;Accession: T45113 

A; Status: translated from GB/EMBL/DDBJ 
A; Molecule type: DNA 

A;Residues: 1-60, f X f , 62-413, 1 D» , 415-452, f XX ! , 455-469 <WE2> 
A;Cross-references: EMBL : AF005666; PIDN : AAC51626 . 1 

A; Experimental source: genomic DNA from individual with properdin deficiency 
type III 

R;Hartmann, S.; Hofsteenge, J. 

J. Biol. Chem. 275, 28569-28574, 2000 

A;Title: Properdin, the positive regulator of complement, is highly C- 
mannosylated. 

A; Reference number: A59360; MUID: 20435812; PMID: 10878002 
A; Contents: annotation 

A; Note: identification and location of C-mannosylation sites by mass- 

spectroscopy ' 

C; Genetics : 

A; Gene: GDB: PFC 

A;Cross-references : GDB: 120275; OMIM: 312060 
A;Map position: Xpll . 3-Xpll . 23 

A;Introns: 26/1; 76/2; 135/1; 192/1; 256/1; 314/1; 378/1; 415/2 
C; Complex: a mixture of homodimers, homotrimers and homotetramers 
C; Function: 

A; Description: protects C3 convertase (C3bBb) from rapid inactivation 
A; Pathway: complement alternate pathway 

C; Super family: human properdin precursor; thrombospondin type 1 repeat homology 
C;Keywords: complement alternate pathway; glycoprotein; homodimer; homotetramer; 
homotrimer; plasma 

F; 1-2 7 /Domain : signal sequence #status predicted <SIG> 
F;28-469/Product : properdin jfstatus experimental <MAT> 
F; 76-128/Domain: thrombospondin type 1 repeat homology <THR1> 
F; 135-191/Domain: thrombospondin type 1 repeat homology <THR2> 
F; 192-2 55 /Domain : thrombospondin type 1 repeat homology <THR3> 
F;256-313/Domain: thrombospondin type 1 repeat homology <THR4> 
F;314-377/Domain: thrombospondin type 1 repeat homology <THR5> 



F; 37 8-4 40/ Domain : thrombospondin type 1 repeat homology <THR6> 

F; 83,86,139,142, 145, 196, 199, 260, 263, 321, 324, 382 , 385, 388/Modif ied site: 2»- 

mannosyl-tryptophan (Trp) #status experimental 

F ; 4 2 8 / Binding site: carbohydrate (Asn) (covalent) #status predicted 

Query Match 5.1%; Score 243; DB 1; Length 469; 

Best Local Similarity 39.5%; Pred. No. l.le-09; 

Matches 45; Conservative 14; Mismatches. 43; Indels 12; Gaps 4; 

Qy 243 GGWSTWTEWSVCSASCGRGWQKRSRSCTNPAPLNGGAFCEGQNVQKTACAT — LCPVDGS 300 

I I I I I I I I : I : I : I I : I : I I I II III : III : I I I : 
Db 137 GGWSGWGPWEPCSVTCSKGTRTRRRACNHPAPKCGG-HCPGQAQESEACDTQQVCPTHGA 195 

Qy 301 WSPWSKWSACGLDC THWRSRECSDPAP — RNGGEECQGTDLDTRNCT 345 

I : I I : I I II I : I I I I : I : I I : I I I 

Db 196 WATWGPWTPCSASCHGGPHEPKETRSRKCSAPEPSQKPPGKPCPGLAYEQRRCT 249 



RESULT 15 
S05478 

properdin - mouse (fragment) 

C; Species: Mus musculus (house mouse) 

C;Date: 07-Sep-1990 #sequence_revision 07-Sep-1990 #text_change 09-Jul-2004 
C;Accession: S05478 
R;Goundis, D. ; Reid, K.B.M. 
Nature 335, 82-85, 1988 

A; Title: Properdin, the terminal complement components, thrombospondin and the 
circumsporozoite protein of malaria parasites contain similar sequence motifs. 
A; Reference number: S05478; MUID : 88318954 ; PMID: 3045564 
A;Accession: S05478 
A; Molecule type: mRNA 
A; Residues: 1-437 <GOU> 

A; Cross-references: UNIPROT : P11680; EMBL:X12905; NID:g53786; PIDN : CAA31389 . 1 ; 
PID:g53787 

C; Complex: a mixture of homodimers, homot rimers and homotetramers 
C; Function: 

A; Description: protects C3 convertase (C3bBb) from rapid inactivation 
A; Pathway: complement alternate pathway 

C; Super family : human properdin precursor; thrombospondin type 1 repeat homology 
C;Keywords: complement alternate pathway; glycoprotein; homodimer; homotetramer ; 
homot rimer; plasma 

F; 4 5- 97 /Domain : thrombospondin type 1 repeat homology <THR1> 

F; 104-160/Domain: thrombospondin type 1 repeat homology <THR2> 

F; 161-224/Domain: thrombospondin type 1 repeat homology <THR3> 

F;225-282/Domain: thrombospondin type 1 repeat homology <THR4> 

F;283-345/Domain: thrombospondin type 1 repeat homology <THR5> 

F; 346-408/Domain: thrombospondin type 1 repeat homology <THR6> 

F; 52, 55, 108, 111, 114, 165, 168, 229, 232, 290, 293, 350, 353, 356/Modif ied site: 2'- 

mannosyl-tryptophan (Trp) #status predicted 

F; 366, 396/Binding site: carbohydrate (Asn) (covalent) #status predicted 

Query Match 4.8%; Score 229; DB 2; Length 437; 

Best Local Similarity 40.4%; Pred. No. 9.9e-09; 

Matches 46;. Conservative 10; Mismatches 46; Indels 12; Gaps 4; 



Qy 243 GGWSTWTEWSVCSASCGRGWQKRSRSCTNPAPLNGGAFCEGQNVQKTACAT — LCPVDGS 300 

I I I I I I I I : I : I I I I I I I I I II II: I Ml II I : 



Db 106 GGWSEWGPWGPCSVTCSKGTQIRQRVCDNPAPKCGG-HCPGEAQQSQACDTQKTCPTHGA 164 



Qy 301 WSPWSKWSACGLDC THWRSRECSDPAPRN — GGEECQGTDLDTRNCT 345 

1:111 I I I I I I I I I : I : I I : : I : 

Db 165 WASWGPWSPRSGSCLGGAQEPKETRSRSCSAPAPSHQPPGKPCSGPAYEHKACS 218 
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ALIGNMENTS 



RESULT 1 
T32541 

unc-5 protein - Caenorhabditis elegans 
C; Species: Caenorhabditis elegans 

C;Date: 29-Oct-1999 #sequence_revision 29-Oct-1999 #text_change 09-Jul-2004 
C;Accession: T32541 
R;Latreille, P. 

submitted to the EMBL Data Library, December 1997 

A; Description: The sequence of C. elegans cosmid B0273. 

A;Reference number: Z21187 

A; Accession: T32541 

A; Status: preliminary; translated from GB/EMBL/DDBJ 
A; Molecule type: DNA 
A; Residues: 1-919 <LAT> 

A; Cross-references : UNIPROT : 044171 ; EMBL:AF036698; PIDN : AAB88355 . 1 ; 
GSPDB:GN00022; CESP : B0273 . 4a 

A; Experimental source: strain Bristol N2; clone B0273 
C; Genetics: 

A; Gene: unc-5; CESP : B0273 . 4a 
A; Map position: 4 



A;Introns: 41/3; 108/1; 142/3; 201/1; 323/2; 553/1; 858/3 

C; Super family: unc-5 protein; immunoglobulin homology; SH3 homology; 

thrombospondin type 1 repeat homology 



Query Match 20.4%; Score 977; DB 2; Length 919; 

Best Local Similarity 28.7%; Pred. No. 2.3e-62; 

Matches 265; Conservative 168; Mismatches 379; Indels 110; Gaps 31; 

Qy 49 EPEDVTI VTCNKPVTjLVCKAVPATQIFFK^ — STDGSSGLPTMEVRI 106 

: I : I : : : I I I : I I : I I I : I : I I : : I : I I I : I I : I I : :: : 
Db 9 QPKSGYVIRNKPLRLQCRANHATKIRYKCSSKWID — DSRIEKLIGTDSTSGVGYIDASV 66 

Qy 107 NVS RQQVEKVFGLEE YWCQCVAWS S S G TTKSQKAYIRIARLRKNFEQEPLAKEVS 161 

: : I I I : : : : I I I I II I I : : I : I I : I : I : I : I 

Db 67 DI S RI DVDT S GHVDAFQCQC YA S GDDDQ DWAS DVATVHLAYMRKH FLKS P VAQ RVQ 123 

Qy 162 LEQGIVTiPCRPPEGIPPAEVEWLRNEDLVT)PSLDPbrvTITREHS 221 

: I I I : I I I I I : I : : : I I III : I I :: I I I : I : I I I I 
Db 124 EGTTLQLPCQAPESDPKAELTWYKDGVWQP — DANVTRASDGSLIMSAARLSDSGNYTC 181 

Qy 222 VAKNIVARRRSASAAVTVYVNGGWSTWTEW-SVCSASCG RGWQKR 265 

II: I : : I : I I : I I I I I : I I I I : I 

Db 182 EATNVANSRKTDPV^VQIYVDGGWSEWSPWIGTCHVDCPLLRQHAHRIRDPHDVLPHQRR 241 

Qy 266 SRSCTNPAPLNGGAFCEGQNVQKTACATLCPVDGSWSPWSKWSACGLDCTHWRSRECSDP 325 

: I : I I I I I I I I : I : I : :| I :|| II II MM I : I : I I : I 

Db 242 TRTCNNPAPLNDGEYCKGEEEMTRSCKVPCKLDGGWSSWSDWSACSSSCHRYRTRACTVP 301 

Qy 326 APRNGGEECQGTDLDTRNCTSDLCVHSASG — PEDVALWGLIAVAVCLVLLLLVLILVY 383 

I I I I : I I I I I : I : II : I I hi : : : : : I : I : 

Db 302 PPMNGGQPCFGDDLMTQECPAQLCTADSSRI VI SDTAWGSVASI FI VAS FI LAI LAMFC 361 

Qy 384 CR KKEGLDSDVADSSILTSGFQPVSIKPSKADNPHLLTI 422 

I • I • • • I • I • • I • • • • I • 

Db 362 CKRGNSKKSKPLKPQKMNSEKAGGIYYS EPPGVRRLLLEHQHGTLLGEKISSCSQYF 418 

Qy 423 -QPDLSTTTT TYQGSLCPRQDGPSPKFQLTNGHLLSPLGGGRHTLHHSSPT-SE 474 

I I : II : I II I I I I I II I : 

Db 419 EPPPLPHSTTLRSGKSAFSGYSSTRNAGSRAALIQECSSSSSGSGGKRTMLRTSSSNCSD 478 

Qy 475 AEEFVSRLSTQNYFRSLPRGTS-NMTYGTFNFLGGRLMI PNTGI SLLI PPDAI PRGKI YE 533 

: : : I II I : : I II : :| MM |: M 

Db 479 DDNYATLYDYMEDKSVLGLDTSQNIVAAQIDSNGARLSLSKSGARLIVPELAVEGEKM— 536 

Qy 534 IYLTLHKPEDVRLPLAGCQTLLSPIVSCGPPGV LLTRPVILAMDHCGEPSP-D 585 

Ml: : I : : I I I : : I I Mill::: | | II 

Db 537 LYLAVSDTLTDQPHLKPIESALSPVIVTGQCDVSMSAHDNILRRPVWS FRHCASTFPRD 596 

Qy 586 SWSLRLKKQSCEGS-WEDVLHLGEEAPSHLYYCQLEASA CYVFTEQLGRFAL 636 

M I : I I I I : : M M : : I I I M I I I I 

Db 597 NWQFTL — YADEGSGWQKAWIGEENLNTNMFVQFEQPGKKNDGFGWCHVWTYSLARLML 654 

Qy 637 VGEAL — SVAAAKRLKLLLFAPVACTSLE — YNIRVYCLHDTHDALKEWQLEKQLGGQL 692 

I | : :| I I I : I M I : : : : I I I I : M I : : I : I I M 

Db 655 AGH PRRN S L S AAKRVHLAVFGPT EMS AYRRP FELRVYCVPET GAAME S VWKQED — GS RL 712 

Qy 693 IQEPR — VLHFKDSYHNLRLSIHDV-PSSLWKSKLLVSYQEIPFYHIWNGTQRYLHCTFT 749 



: I : I : I I I : I I I I : I | : I I I I : 
Db 713 LCESNDFILNEKG NLCICIEDVIPGFSCDGPEWEISETQHRFV AQNGLHCSLK 766 

Qy 750 LERVSPSTSDLACKLWVWQVEGDGQSFSINFNITKDTRFAELLALESEAGVPALVGPSAF 809 

: I : : : | : | : : : : : : : I I I : I 
Db 767 FRPKEINGSQFSTRVIVYQKASSTEPMVM — EVSNEPELYDATSEEREKGSVCV EF 820 

Qy 810 KIPFLIRQKIISSLDPPCRRGADWRTLAQKLHLDSHLSFFASKP— S PTAMI LNLWEARH 867 

::M :: :: III : I I I I I : I I I I : I I I I I I lll:::|:|||| 
Db 821 RLP FGVKDELARLLDMPNESHS DWRGLAKKLH YDRYLQFFAS FPDCS PTS LLLDLWEAS S 880 

Qy 868 FPNGN-LSQLAAAVAGLGQPDA 888 

: : I : : I : I i I 
Db 881 SGSARAVPDLLQTLRVMGRPDA 902 



RESULT 2 
B44294 

unc-5 protein, long form - Caenorhabditis elegans 
N; Contains: unc-5 protein, short form 
C; Species: Caenorhabditis elegans 

C;Date: 30-Apr-1993 #sequence_revision 28-Jul-1995 #text_change 09-Jul-2004 
C;Accession: B44294; T32540; A44294 

R;Leung-Hagesteijn, C; Spence, A.M.; Stern, B.D.; Zhou, Y. ; Su, M.W.; 
Hedgecock, E.M.; Culotti, J.G. 
Cell 71, 289-299, 1992 

A; Title: UNC-5, a transmembrane protein with immunoglobulin and thrombospondin 

type 1 domains, guides cell and pioneer axon migrations in C. elegans. 

A; Reference number: A44294; MUID : 93046629; PMID: 1384987 

A; Contents: variety Bergerac 

A; Accession: B44294 

A;Molecule type: DNA 

A; Residues: 1-947 <LEU> 

A; Cross-references: UNIPROT : 044171; GB:S47168; NID:g258527; PIDN : AAB23867 . 1 ; 
PID:g258529 

A;Note: sequence extracted f rom NCBI backbone (NCBIN : 116668, NCBIN: 116670, 
NCBIN:116672, NCBIN : 116674 , NCBIN: 116676, NCBIN: 116678, NCBIN: 116680, 
NCBIN: 116682, NCBIN: 116685, NCBIP : 118648 ) 

A; Note: authors translated the codon CTA for residue 642 as Val; sequence shown 
follows the authors * translation 

A; Note: mRNA lacking the first exon is equally prevalent 
R;Latreille, P. 

submitted to the EMBL Data Library, December 1997 

A; Description: The sequence of C. elegans cosmid B0273. 

A; Reference number: Z21187 

A;Accession: T32540 

A; Status: preliminary; translated from GB/EMBL/DDBJ 
A; Molecule type: DNA 
A; Residues: 1-947 <LAT> 

A; Cross-references : EMBL: AF03 6698 ; PIDN :AAB8 8356. 1; GSPDB: GN00022 ; CESP : B0273 . 4b 

A; Experimental source: strain Bristol N2; clone B0273 

C; Genetics: 

A; Gene: unc-5 

A; Map position: 4 

A;Introns: 28/1; 69/3; 136/1; 170/3; 229/1; 351/2; 581/1; 886/3 
C; Function: 



A; Description: required for guidance of pioneering axons and cells migrating 
dorsally along the body wall; proposed to be a receptor on the surface of the 
motile cells 

C;Superfamily: unc-5 protein; immunoglobulin homology; SH3 homology; 
thrombospondin type 1 repeat homology 

C;Keywords: alternative splicing; duplication; glycoprotein; receptor; 
transmembrane protein 

F;30-947/Product: unc-5 protein, short form #status predicted <ALT> 
F; 4 6- 11 6/ Domain: immunoglobulin homology <IM1> 
F; 153-2 11 /Domain: immunoglobulin homology <IM2> 

F;229-300/Domain: thrombospondin type 1 repeat homology #status atypical <THR1> 
F;301-354/Domain: thrombospondin type 1 repeat homology <THR2>. 
F;365-390/Domain: transmembrane #status predicted <TMM> 
F;512-559/Domain: SH3 homology <SH3> 

F; 53-114, 65-112, 160-209/Disulf ide bonds: #status predicted 
F;206/Binding site: carbohydrate (Asn) (covalent) #status predicted 

Query Match 20.4%; Score 977; DB 1; Length 947; 

Best Local Similarity 28.7%; Pred. No. 2.4e-62; 

Matches 265; Conservative 168; Mismatches 379; Indels 110; Gaps 31; 

Qy 49 E PE DVY I VKN K PVL LVC KAVP ATQ I F FKCN GEWVRQVDHVI E R — STDGS SGLPTMEVRI 106 

: I : I : : : | | | : | | : | | | : | : | | : : | : | || : | | : | | : : : : 
Db 37 QPKSGYVIRNKPLRLQCRANHATKIRYKCSSKWID— DSRIEKLIGTDSTSGVGYIDASV 94 

Qy 107 NVS RQQVEKVFGLEEYWCQCVAWS S SG TTKSQKAYIRIARLRKNFEQEPLAKEVS 161 

5:11 I : : : : I I I I II I I : : I : II : I : I : I : I 

Db 95 DISRIDVDTSGHVDAFQCQCYA SGDDDQDWASDVATVHLAYMRKHFLKS PVAQRVQ 151 

Qy 162 LEQGIVXPCRPPEGIPPAEVTCWLRNEDLVTDPSLDPNW 221 

: I I I : I I I I I : I : : : I I III : II:: I I I : I : I I I I 
Db 152 EGTTLQLPCQAPESDPKAELTWYKDGVWQP — DANVI RAS D GS L I MS AARL S D S GN YT C 209 

Qy 222 VAKNIVARRRSASAAVIVYVNGGWSTWTEW-SVCSASCG RGWQKR 265 

II: I :: I : I I : I I I I I : I I I I : I 

Db 210 EATNVANSRKTDPVEVQI YVDGGWS EWS PWI GTCHVDCPLLRQHAHRI RDPHDVLPHQRR 269 

Qy 266 SRSCTNPAPLNGGAFCEGQWQKTACATLCPVT)GSWSPWSKWSACGLDCTHWRSRECSDP 325 

: I : I I I I I I I I : I : I : : I I : I I I I I I I I I I I : I : I I : I 

Db 270 TRTCNNPAPLNDGEYCKGEEEMTRSCKVPCKLDGGWS SWSDWSACS S SCHRYRTRACTVP 329 

Qy 326 APRNGGEECQGTDLDTRNCTSDLCVHSASG — P EDVAL WGLI AVAVCLVLLLLVL I LVY 383 

I I I I : I I I I I : I : I I : I I hi : : : : : I : I : 

Db 330 P PMNGGQPCFGDDLMTQECPAQLCTADS SRI VI S DTAVYGSVAS I FI VAS FI LAI LAMFC 389 

Qy 384 CR KKEGLDSDVADSSILTSGFQPVSIKPSKADNPHLLTI 422 

I : | : : : | : | : : | : : : : | : 
Db 390 CKRGNSKKSKPLKPQKMNSEKAGGIYYS EPPGVRRLLLEHQHGTLLGEKISSCSQYF 446 

Qy 423 -QPDLSTTTT T YQGS LCPRQDGP S PKFQLTNGHLLS PLGGGRHTLHHS S PT- SE 474 

I I : I I : I II I I I I I I I I : 

Db 447 EPPPLPHSTTLRSGKSAFSGYSSTRNAGSRAALIQECSSSSSGSGGKRTMLRTSSSNCSD 506 

Qy 475 AEEFVSRLSTQNYFRSLPRGTS-NMTYGTFNFLGGRLMIPNTGISLLIPPDAIPRGKIYE 533 

: : : I II I : : I I I : : I I : : I I : I : 

Db 507 DDNYATLYDYMEDKSVLGLDTSQNIVAAQIDSNGARLSLSKSGARLIVPELAVEGEKM — 564 



Qy 

Db 



534 IYLTLHKPEDVRLPLAGCQTLLSPIVSCGPPGV LLTRPVILAMDHCGEPSP-D 585 

: I I : : I : : III:: I I : I I I | : : : I I I I 

565 LYIAVSDTLTDQPHLKPIESALSPVIVIGQCDVSMSAHDNILRRPVWSFRHCASTFPRD 624 



Qy 586 SWSLRLKKQSCEGS-WEDVLHLGEEAPSHLYYCQLEASA CYVFTEQLGRFAL 636 

: I I : | | | | : : : | | | : : | | I : I I I I I 

Db 625 NWQFTL — YADEGSGWQKAWIGEENLNTNMFVQFEQPGKKNDGFGWCHVMTYSLARLML 682 

Qy 637 VGEAL — SVAAAKRLKLLLFAPVACTSLE — YNIRVYCLHDTHDALKEWQLEKQLGGQL 692 

I I : : I I I I : I : I I : : : : I I I I : : I I : : I : I I : I 

Db 683 AGHPRRNSLSAAKRVHLAVFGPTEMSAYRRPFELRVYCVPETGAAMESVWKQED — GSRL 740 

Qy 693 IQEPR — VLHFKDSYHNLRLSIHDV-PSSLWKSKLLVSYQEIPFYHIWNGTQRYLHCTFT 749 

: I : I : I I I : I I I I : I I : I III: 
Db . 741 LCESNDFILNEKG NLCICIEDVI PGFSCDGPEWEI SETQHRFV AQNGLHCSLK 794 

Qy 750 LERVSPSTSDLACKLWVWQVEGDGQSFSINFNITKDTRFAELLALESEAGVPALVGPSAF 809 

•I •••1*1 • • ill : : | | | : J 
Db 795 FRPKEINGSQFSTRVIVYQKASSTEPMVM — EVSNEPELYDATSEEREKGSVCV EF 848 

Qy 810 KIPFLIRQKIISSLDPPCRRGADWRTLAQKLHLDSHLSFFASKP — SPTAMILNLWEARH 867 

: : I I : : : : III : I I I I I : I I I I : I I I I I I I I I : : : I : I I I I 
Db 849 RLP FGVKDELARLLDMPNESHSDWRGLAKKLH YDRYLQFFAS FPDCS PTS LLLDLWEAS S 908 



Qy 868 FPNGN-LSQLAAAVAGLGQPDA 888 

: : I : : I : I I I 
Db 909 S G S ARAVP DL LQT LRVMGRP DA 930 



RESULT 3 
T00026 

brain-specific angiogenesis inhibitor 1 - human 
N;Alternate names: BAI1 protein 
C; Species: Homo sapiens (man) 

C;Date: 22-Jan-1999 #sequence_revision 22-Jan-1999 #text_change 09-Jul-2004 
C;Accession: T00026 

R;Nishimori, H.; Shiratsuchi, T.; Urano, T.; Kimura, Y. ; Kiyono, K. ; Tatsumi, 
K. ; Yoshida, S.; Ono, M. ; Kuwano, M. ; Nakamura, Y. 
submitted to the EMBL Data Library, June 1997 
A; Reference number: Z14064 
A;Accession: T00026 

A; Status: translated from GB/EMBL/DDBJ 
A; Molecule type: mRNA 
A; Residues: 1-1584 <NIS> 

A; Cross-references: UNIPROT:014514; EMBL: ABO 052 97 ; NID : dll75078 ; PID:dl024528 

A; Experimental source: brain 

C; Genetics : 

A; Gene : GDB : BAI 1 

A;Cross-references: GDB: 9838088; OMIM: 602682 
A; Map position: 8q24-8q24 

F;408-462/Domain: thrombospondin type 1 repeat homology <THR3> 

Query Match 6.2%; Score 298.5; DB 2; Length 1584; 

Best Local Similarity 33.5%; Pred. No. 5.2e-13; 

Matches 78; Conservative 35; Mismatches 91; Indels 29; Gaps 11; 



Qy 124 CQCVAWSSSGTTKSQKAYIRIARLRKNFEQEPLAKEVSLEQGIVXPCRPPEGIPPAEVEW 183 



Db 



I I : I I I : : I I : : I I I : I I I I I Nil 
309 CNREACGPAGRTSSRSQSLRSTDARR REELGDEL QQFGFPA-PQTGDPAAE-EW 360 



Qy 



Db 



184 LRNEDLVDPSLDPNWITREHSLVWQARLADTA^ 243 

: II : I I I : : : I : | : | 

361 — SPWSVCSSTCGEGWQTR TRFCVSSSYSTQCSGPLREQRLCNNSAVCPVHG 410 



Qy 



244 GWSTWTEWSVCSASCGRGWQKRSRSCTNPAPLNGGAFCEGQNVQKTAC-ATLCP VDG 299 




Db 



411 AWDEWSPWSLCSSTCGRGFRDRTRTCR — PPQFGGNPCEGPEKQTKFCNIALCPGRAVDG 4 68 



Qy 



Db 



300 SWSPWSKWSACGLDCT HWRSRECSDPAPRNGGEECQGTDLDTRNCTSDLC 349 

:|: II Mil I : |:|||: |: II MM ::||:| I 

469 NWNEWSSWSACSASCSQGRQQRTRECNGPS— YGGAECQGHWVETRDCFLQQC 519 



RESULT 4 
TSHUP2 

thrombospondin 2 precursor - human 
C; Species: Homo sapiens (man) 

C;Date: 19-May-1995 #sequence_revision 03-Aug-1995 #text_change 09-Jul-2004 
C;Accession: A47379; A42173 
R;LaBell, T.L.; Byers, P.H. 
Genomics 17, 225-229, 1993 

A;Title: Sequence and characterization of the complete human thrombospondin 2 

cDNA: potential regulatory role for the 3 1 untranslated region. 

A; Reference number: A47379; MUID : 94010892 ; PMID: 8406456 

A;Accession: A47379 

A; Molecule type: mRNA 

A; Residues: 1-1172 <LAB> 

A; Cross-references: UNIPROT: P35442; GB:L12350; NID:g307505; PIDN: AAA03703 . 1; 
PID:g307506 

R; LaBell, T.L.; Milewicz, D.J.; Disteche, CM. ; Byers, P.H. 
Genomics 12, 421-429, 1992 

A; Title: Thrombospondin II: partial cDNA sequence, chromosome location, and 

expression of a second member of the thrombospondin gene family in humans. 

A; Reference number: A42173; MUID : 92217961 ; PMID: 1559694 

A; Accession: A42173 

A; Molecule type: mRNA 

A; Residues: 560-1172 <LA2> 

A; Cross-references : GB:M81339 

A; Experimental source: fibroblast 

A;Note: sequence extracted from NCBI backbone (NCBIN : 95091, NCBIP: 95096) 
C; Genetics : 

A; Gene: GDB:THBS2; TSP2 

A;Cross-references: GDB: 128789; OMIM: 188061 
A;Map position: 6q27-6q27 
C;Complex: homotrimer, disulfide linked 
C; Function: 

A; Description: participates in cell migration and adhesion, and in platelet 
aggregation 

C; Super family : thrombospondin 1; EGF homology; thrombospondin type 1 repeat 
homology; von Willebrand factor type C repeat homology 
C;Keywords: beta-hydroxyasparagine; calcium binding; cell adhesion; 
glycoprotein; trimer 

F; 1-18/Domain: signal sequence #status predicted <SIG> 
F;19-1172/Product: thrombospondin 2 ttstatus predicted <MAT> 



F;319-377/Domain: von Willebrand factor type C repeat homology <VWC> 
F; 3 8 0-4 31 /Domain: thrombospondin type 1 repeat homology <THR1> 
F; 4 3 6-4 92 /Domain: thrombospondin type 1 repeat homology <THR2> 
F;493-549/Domain: thrombospondin type 1 repeat homology <THR3> 
F;553-588/Domain: EGF homology <EGF1> 
F;652-691/Domain: EGF homology <EGF> 
F;928-930/Region: cell attachment (R-G-D) motif 

F; 151, 316, 330,457,584, 710, 1069/Binding site: carbohydrate (Asn) (covalent) 
#status predicted 

F; 167-226/Disulf ide bonds: #status predicted 

F;266, 270/Disulfide bonds: interchain #status predicted 

F; 612 /Modified site: erythro-beta-hydroxyasparagine (Asn) #status predicted 

Query Match 6.2%; Score 296.5; DB 1; Length 1172; 

Best Local Similarity 30.5%; Pred. No. 4.9e-13; 

Matches 78; Conservative 28; Mismatches 105; Indels 45; Gaps 9; 

Qy 209 RQARLADTANYTCVAKNIVARRRSASAA-VT 267 

: : I I : I I : : I I II : : I I I I I : I I I I : I I I II 

Db 403 QRGRSCDVTSNTCLGPSIQTRACSLSKCDTRIRQDGGWSHWSPWSSCSVTCGVGNITRIR 462 

Qy 268 SCTNPAPLNGGAFCEGQNVQKTAC-ATLCPVDGSWSPWSKWSACGLDCT HWRSRECS 323 

I : I I II I : I : II I I : I I I I I I I I I I I : I I : I I : 

Db 463 LCNSPVPQMGGKNCKGSGRETKACQGAPCPIDGRWSPWSPWSACTVTCAGGIRERTRVCN 522 

Qy 324 DPAPRNGGEECQGTDLDTRNCTSDLCVliSASGPEDVALWGLIAVAVCLVLLLLVLILVY 383 

11:11:11 : : I I III II 
Db 523 SPEPQYGGKACVGDVQERQMCNKRSC PVDGCLSNPCFPGAQC 564 

Qy 384 CRKKEGLDSDVADSSILTSGFQPVSI — KPSKADNPHLLTIQPDLSTTTT TYQ 434 

I 11:11.11 : :: : ||: :|: | 

Db 565 SSFPDGS-WSCGFCPVGFLGNGTHCEDLDECALVPDICFSTSKVPRCVNTQP 615 

Qy 435 GSLC PRQDGPSP 446 

II II I I 

Db 616 GFHCLPCPPRYRGNQP 631 



RESULT 5 
JC5928 

semaphorin F precursor - human 
C; Species: Homo sapiens (man) 

C;Date: 10-Apr-1998 #sequence_revision 08-May-1998 #text_change 09-Jul-2004 
C; Accession: JC5928 

R;Simmons, A.D.; Pueschel, A.W. ; McPherson, J.D.; Overhauser, J.; Lovett, M. 
Biochem. Biophys . Res. Commun. 242, 685-691, 1998 

A; Title: Molecular cloning and mapping of human semaphorin F from the Cri-du- 
chat candidate interval . 

A; Reference number: JC5928; MUID : 98125554 ; PMID: 9464278 
A; Accession : JC5928 

A; Status: nucleic acid sequence not shown 
A; Molecule type: mRNA 
A; Residues: 1-1074 <SIM> 

A; Cross-references: UNIPROT :Q13591; GB:U52840; NID: g2772583; PIDN : AAC09473 . 1; 
PID:g2772584 

A; Experimental source: brain 



C; Comment: This protein disrupts normal brain development and leads to some of 
the features of Cri-du-chat. 
C; Genetics : 
A; Gene: semaf 

C;Superfamily: human semaphorin F; thrombospondin type 1 repeat homology 
F; 1-20 /Domain : signal sequence #status predicted <SIG> 
F;50-533/Domain: semaphorin #status predicted <SEM> 
F;840-896/Domain: thrombospondin type 1 repeat homology <THR3> 
F;971-993/Domain: transmembrane #status predicted <TMM> 

Query Match 6.1%; Score 293; DB 2; Length 1074; 

Best Local Similarity 45.8%; Pred. No. 7.8e-13; 

Matches 54; Conservative 11; Mismatches 49; Indels 4; Gaps 2; 

Qy 241 VNGGWSTWTEWSVCSASCGRGWQKRSRSCTNPAPLNGGAFCEGQNVQKTACATL-CPVDG 299 

Ml M M M II 111:111111 II II::: I I I I I I I I 
Db 783 VNGAWSAWTSWSQCSRDCSRGIRNRKRVCNNPEPKYGGMPCLGPSLEYQECNTLPCPVDG 842 

Qy 300 SWSPWSKWSACGLDC THWRSRECSDPAPRNGGEECQGTDLDTRNCTSDLCVHSAS 354 

I I I I I : I I : I : I I I : I I I I I : I I : I : I I | 
Db 843 VWSCWSPWTKCSATCGGGHYMRTRSCSNPAPAYGGDICLGLHTEEALCNTQPCPESWS 900 



RESULT 6 
A42587 

thrombospondin 2 precursor - mouse 
C; Species: Mus musculus (house mouse) 

C;Date: 04-Mar-1993 #sequence_revision 18-Nov-1994 #text_change 09-Jul-2004 
C; Accession: A42587; A39851 

R;Laherty, CD.; O'Rourke, K. ; Wolf, F.W.; Katz, R. ; Seldin, M.F.; Dixit, V.M. 
J. Biol. Chem. 267, 3274-3281, 1992 

A;Title: Characterization of mouse thrombospondin 2 sequence and expression 
during cell growth and development. 

A; Reference number: A42587; MUID: 92147683 ; PMID: 1371115 
A; Accession: A42587 

A; Status: preliminary; not compared with conceptual translation 
A;Molecule type: nucleic acid 
A; Residues: 1-1172 <LAH> 

A; Cross-references: UNIPROT:Q03350; GB:L07803; GB:M87275; NID:g340421; 
PIDN:AAA53064.1; PID:g567241 

A;Note: sequence extracted f rom NCBI backbone (NCBIP : 81502 ) 

R;Bornstein, P.; O'Rourke, K. ; Wikstrom, K. ; Wolf, F.W. ; Katz, R. ; Li, P.; 

Dixit, V.M. 

J. Biol. Chem. 266, 12821-12824, 1991 

A; Title: A second, expressed thrombospondin gene (Thbs2) exists in the mouse 
genome. 

A; Reference number: A39851; MUID: 91302287 ; PMID: 1712771 
A;Accession: A39851 
A; Status : preliminary 
A; Molecule type: mRNA 
A; Residues: 1-873 <BOR> 

A; Cross-references: GB:M64866; NID:g201994; PIDN: AAA40432 . 1; PID:g201995 
C; Super family: thrombospondin 1; EGF homology; thrombospondin type 1 repeat 
homology; von Willebrand factor type C repeat homology 
C; Keywords: calcium binding; glycoprotein 

F; 319-37 7 /Domain : von Willebrand factor type C repeat homology <VWC> 
F; 380-431/Domain: thrombospondin type 1 repeat homology <THR1> 



F;436-492/Domain: thrombospondin type 1 repeat homology <THR2> 

F; 4 93-54 9/ Domain : thrombospondin type 1 repeat homology <THR3> 

F;553-588/Domain: EGF homology <EGF1> 

F;652-691/Domain: EGF homology <EGF> 

Query Match 6.1%; Score 293; DB 2; Length 1172; 

Best Local Similarity 38.0%; Pred. No. 8.7e-13; 

Matches 60; Conservative 22; Mismatches 66; Indels 10; Gaps 5; 

Qy 209 RQARLADTANYTCVAKNIVARRRS-ASAAVIVYVNGGW 267 

: • I I : I I : : I I I : I I I I I I : I I I I : I I I II 

Db 403 QRGRSCDWSNTCLGPSIQTRTCSLGKCDTRIRQNGGWSHWSPWSSCSVTCGVGNVTRIR 462 

Qy 268 SCTNPAPLNGGAFCEGQNVQKTAC-ATLCPVDGSWSPWSKWSACGLDCT HWRSRECS 323 

I : I I II I : I : I I I : I I I I I I I I I I I : I I I I I : 

Db 463 LCNSPVPQMGGKNCKGSGRETKPCQRDPCPIDGRWSPWSPWSACTVTCAGGIRERSRVCN 522 

Qy 324 DPAPRNGGEECQG — TD LDTRNCTSDLCVHSASGP 356 

I I: II::! I |: : |:| I I : : I 
Db 523 SPEPQYGGKDCVGDVTEHQMCNKRSCPIDGCLSNPCFP 560 



RESULT 7 
T18856 

angiogenesis inhibitor homolog - Caenorhabditis elegans 
C; Species: Caenorhabditis elegans 

C;Date: 15-Oct-1999 #sequence_revision 15-Oct-1999 #text_change 09-Jul-2004 
C;Accession: T18856; T24653 
R;McMurray, A. 

submitted to the EMBL Data Library, July 1995 
A;Reference number: Z19031 
A; Accession: T18856 

A; Status: preliminary; translated from GB/EMBL/DDBJ 

A; Molecule type: DNA 

A; Residues: 1-1444 <WIL> 

A;Cross-references: UNIPROT : Q8MYA8 ; EMBL:Z50004; PIDN: CAA90293 . 1 ; GSPDB: GN00028 ; 
CESP:C02B4.1 

A; Experimental source: clone C02B4 
R;McMurray, A. 

submitted to the EMBL Data Library, July 1995 
A;Reference number: Z19917 
A; Accession: T24653 

A; Status: preliminary; translated from GB/EMBL/DDBJ 

A; Molecule type: DNA 

A; Residues: 1-1444 <WI2> 

A;Cross-references: EMBL:Z50006; PIDN: CAA90302 . 1 ; GSPDB : GN00028 ; CESP:C02B4.1 

A; Experimental source: clone T07C5 

C; Genetics: 

A; Gene: CESP:C02B4.1 

A;Map position: X 

A;Introns: 25/3; 70/3; 96/3; 139/3; 187/1; 234/2; 282/3; 376/2; 422/2; 478/3; 
509/3; 566/2; 625/1; 696/2; 786/3; 812/2; 878/3; 971/1; 1007/3; 1067/1; 1099/3; 
1180/3; 1273/2; 1305/1; 1363/1; 1388/2 



Query Match 5.8%; Score 276; DB 2; Length 1444; 

Best Local Similarity 27.2%; Pred. No. 2e-ll; 

Matches 73; Conservative 28; Mismatches 97; Indels 70; Gaps 12; 



Qy 123 WCQCVAWSSSGTTKSQKAYIRIARLRKNFEQ EPLAKEVSLEQGIVLPCRPPEGI 176 

I : : I 1 : : : I : I I : III : I I I 

Db 1134 WSEWSSWSAC S C FS LT S T RRRFCQWD PT VQ GFCAGAI LEQ IPCAPGSCS 1183 

Qy 177 PPAE VEW LRNEDLVDPSLDPNVYITREHSLWRQARLADTAN 218 

II II : I I : I II: 

Db 1184 PSAGGWSLWSEWSSCSKDCGDTGHQIRNRMCSEP IPSNRGAYCSG 1228 

Qy 219 YT CVAKNIVARRRSASAAVIVYVNGGWSTWTEWSVCSASCGRGWQKRSRSCTNPA 273 

I : II I : : : I : I I I : I I I I I : I I : I : I I I I 

Db 1229 YSFDQRPCVMDNVCSDEK VDGGWTDWTAWSECTDYCRNGHRSRTRFCANPK 1279 

Qy 274 PLNGGAFCEGQNVQKTAC — ATLCPV-DGSWSPWSKWSACGLDC- — THWRSRECSDPAP 327 

I Mill:: | I : I I I I I I I : I I I I I I I I 

Db 1280 PSQGGAQCTGSDFELNPCFDPARCHLRDGGWSTWSDWTPCSASCGFGVQTRDRSCSSPEP 1339 

Qy 328 RNGGEECQGTDLDTRNCTSDLCVHSASG 355 

: I I : I I I I I I : I 
Db 1340 K-GGQSCSGLAHQTSLCDLPACDHESDG 1366 

RESULT 8 
T00326 

hypothetical protein KIAA0550 - human 
C; Species: Homo sapiens (man) 

C;Date: 01-Feb-1999 #sequence_revision Ol-Feb-1999 #text_change 15-Mar-2004 
C;Accession: T00326 

R;Nagase, T.; Ishikawa, K. ; Miyajima, N . ; Tanaka, A.; Kotani, H. ; Nomura, N.; 
Ohara, 0. 

DNA Res. 5, 31-39, 1998 

A;Title: Prediction of the coding sequences of unidentified human genes. IX. The 
complete sequences of 100 new cDNA clones from brain which can code for large 
proteins in vitro. 

A;Reference number: Z14086; MUID: 98290545; PMID:9628581 
A; Accession: T00326 

A; Status: preliminary; translated from GB/EMBL/DDBJ 
A;Molecule type: mRNA 
A; Residues: 1-984 <NAG> 

A; Cross-references: EMBL: AB011122 ; NID: g3043623 ; PIDN: BAA25476 . 1; PID:g3043624 
A; Experimental source: brain 
C; Genetics : 
A;Note: KIAA0550 

F;344-398/Domain: thrombospondin type 1 repeat homology <THR3> 



Query Match . 5.7%; Score 275; DB 2; Length 984; 

Best Local Similarity 39.0%; Pred. No. 1.4e-ll; 

Matches 57; Conservative 20; Mismatches 53; Indels 16; 



Gaps 



6; 



Qy 

Db 

Qy 

Db 



220 TCVA KNIVARRRSASAAVIVTWGGWSTWTEWSVCSASCGRGWQKRSRSCTNPA 273 

Ml: : I : : I : I I I : I I : I I : I I I I : I : I I I I 

317 TCVSPYGTHCSGPLRESRVCNNTALCPVHGVWEEWSPWSLCSFTCGRGQRTRTRSCT — P 374 



274 



375 



PLNGGAFCEGQNVQKTAC-ATLCPVDGSWS PWS KWSACGLDC THWRSRECSDPAPRN 329 

I M Ml I I I I I I I I I I I I I : I I I I I : I : I : 

PQYGGRPCEGPETHHKPCNIALCPVDGQWQEWSSWSQCSVTCSNGTQQRSRQCT — AAAH 432 



Qy 

Db 



330 GGEECQGTDLDTRNCTSDLCVHSASG 355 

I I I I : I : : I I : I : I : I 
433 GGSECRGPWAESRECYNPEC — TANG 456 



RESULT 9 
T00028 

brain-specific angiogenesis inhibitor 3 - human 
N; Alternate names: BAI3 protein 
C; Species: Homo sapiens (man) 

C;Date: 22-Jan-1999 #sequence_revision 22-Jan-1999 #text_change 09-Jul-2004 
C; Accession: T00028 

R;Shiratsuchi, T.; Nishimori, H.; Ichise, H.; Nakamura, Y. ; Tokino, T. 
Cytogenet. Cell Genet. 79, 103-108, 1997 

A;Title: Cloning and characterization of BAI2 and BAI3, novel genes homologous 

to brain-specific angiogenesis inhibitor 1 (BAI 1) . 

A; Reference number: Z14066; MUID: 98194217 ; PMID: 9533023 

A; Accession: TOO 02 8 

A; Status: translated from GB/EMBL/DDBJ 
A;Molecule type: mRNA 
A; Residues: 1-1522 <SHI> 

A; Cross-references: UNIPROT: 060242; EMBL: AB005299; NID: g3021700; 

PIDN:BAA25363.1; PID:g3021701 

A; Experimental source: brain 

C; Genetics : 

A; Gene: GDB : BAI 3 

A; Cross-references: GDB: 9838090; OMIM: 602684 
A; Map position: 6ql2-6ql2 

F;344-398/Domain: thrombospondin type 1 repeat homology <THR3> 

Query Match 5.7%; Score 275; DB 2; Length 1522; 

Best Local Similarity 39.0%; Pred. No. 2.5e-ll; 

Matches 57; Conservative 20; Mismatches 53; Indels 16; Gaps 6; 

Qy 220 TCVA KNIVARRRSASAAVIVYVNGGWSTWTEWSVCSASCGRGWQKRSRSCTNPA 273 

IN: : I : : I : I I I : I I : I I : I I I I : I : I I I I 

Db 317 TCVSPYGTHCSGPLRESRVCNNTALCPVHGVWEEWSPWSLCSFTCGRGQRTRTRSCT — P 374 

Qy 274 PLNGGAFCEGQNVQKTAC-ATLCPVDGSWSPWSKWSACGLDC THWRSRECSDPAPRN 329 

I M Ml I I I I I I I I I I I I I : I I I I I : I : I : 

Db 375 PQYGGRPCEGPETHHKPCNIALCPVDGQWQEWSSWSQCSVTCSNGTQQRSRQCT — AAAH 432 

Qy 330 GGEECQGTDLDTRNCTSDLCVHSASG 355 

II 11:1 ::| I : I :|:| 

Db 433 GGSECRGPWAESRECYNPEC— TANG 456 



RESULT 10 
T00027 

brain-specific angiogenesis inhibitor 2 - human 
N;Alternate names: BAI2 protein 
C; Species: Homo sapiens (man) 

C;Date: 22-Jan-1999 #sequence_revision 22-Jan-1999 #text_change 09-Jul-2004 
C;Accession: T00027 

R;Shiratsuchi, T.; Nishimori, H.; Ichise, H.; Nakamura, Y. ; Tokino, T. 
Cytogenet. Cell Genet. 79, 103-108, 1997 



A;Title: Cloning and characterization of BAI2 and BAI3, novel genes homologous 

to brain-specific angiogenesis inhibitor 1 (BAI 1) . 

A; Reference number: Z14066; MUID: 98194217 ; PMID: 9533023 

A; Accession : T00027 

A; Status: translated from GB/EMBL/DDBJ 
A;Molecule type: mRNA 
A; Residues: 1-1572 <SHI> 

A;Cross-references : UNIPROT : 060241; EMBL: AB005298 ; NID: g3021698 ; 

PIDN:BAA25362.1; PID:g3021699 

A; Experimental source: brain 

C; Genetics : 

A; Gene: GDB : BAI 2 

A;Cross-references: GDB: 9838089; OMIM: 602683 
A; Map position: Ip35-lp35 

Query Match 5.7%; Score 274.5; DB 2; Length 1572; 

Best Local Similarity 19.2%; Pred. No. 2.8e-ll; 

Matches 176; Conservative 108; Mismatches 307; Indels 327; Gaps 38; 

Qy 173 PEGIPPAEVEWLRNEDLVDPSLDPNVY 1 T REH S L WRQARL 213 

I I I : : I I : I : I : I : I I I I I 

Db 271 PEEEPKVKTQWPRSAD EPGLYMAQTGDPAAEEWSPWSVCSLTCGQGLQVR-TRS 323 

Qy 214 ADTANYTCVAKNIVARRRSASAAVIVYVNGGW 273 

: : I : : I : : I : I I I I I : I I I I I I I : I I : I 
Db 324 CVSSPYGTLCSGPLRETRPCNNSATCPVHGVWEEWGSWSLCSRSCGRGSRSRMRTCV — P 381 

Qy 274 PLNGGAFCEGQNVQKTACA-TLCPVDGSWSPWSKWSACGLDC THWRSRECSDPAPR- 328 

1:1! M I : I I : I I I : I I I I I I I I I I : i I I 
Db 382 PQHGGKACEGPELQTKLCSMAACPVEGQWLEWGPWGPCSTSCANGTQQRSRKCSVAGPAW 441 

Qy 329 NGGEECQ 335 

I I : 

Db 442 ATCTGALTDTRECSNLECPATDSKWGPWNAWSLCSKTCDTGWQRRFRMCQATGTQGYPCE 501 

Qy 336 GTDLDTRNCTSDLC — VH S AS G P EDVAL 361 

I I : : I : I I III 
Db 502 GTGEEVT^PCSEKRCPAFHEMCRDEYVMLMTWKKAAAGEIIYNKCPPNASGSAS RRCLLSA 561 

Qy 362 YVGLIAVAVCL VLLLLVLILVYCRKKEGLDSDVADSSILTSGFQPVSIKPSKA 414 

I I I : I I : I : : : I : : : I : : I I : : : 

Db 562 QGVAYWGLPSFARCISHEYRYLYLSLREHLAKGQRMLAGEGMSQVVRS-LQELLARRTYY 620 

Qy 415 DNPHLLTIQPDLSTTTTTYQGSLCPRQDGPSPKFQLT NGHLLSPLGG 4 61 

Is: : | | : : | | | | : : : I I I 

Db 621 S GDLL FS VD I LRNVT DT FKRAT YVP S ADDVQRFFQWS FMVDAENKEKWDDAQQVS P — G 678 

Qy 462 GRHTLHHSSPTSEAEEFV SRLSTQNYFRSLPRG TSNMTYGTFN 504 

II I : I : I : I I I : I : I : : I : 

Db 679 SVHLLR WEDFI HLVGDALKAFQS SLI VTDNLVI S IQREPVS AVS S DIT FPMRG 732 

Qy 505 FLG GRLMIPNTGISLLIP PDAIPRGK 530 

I I I : I : I I I I : I I 

Db 733 RRGMKDWVRHSEDRLFLPKEVLSLSSPGKPATSGAAGSPGRGRGPGTVPPGPGHSHQRLL 792 



Qy 



531 



I YE- 1 YLTLHKPEDVRLPLAGCQTLLS PI VS CGPPGVLLTRPVI L 5 7 4 
:| : I I I I III ::: I: II |:| 



Db 


793 


PADPDESSYFVTGAVLYRTLGLILPPP RPPLAVTSRVMT — VTVRPPTQPPAEPLIT 


847 


Qy 


575 


A MDHCGEPSPDSWSLRLKKQSCEGSWEDVXHLGEEAPSHLYYCQ-LEASACYV-- 

:| II : : | |: Mill: 
VELSYIINGTTDPHCASWDYS-RADASSGDWD TENCQTLETQAAHTRC 


626 


Db 


848 


894 


Qy 


627 


FTEQLGRFALVGE ALSVAAAKRLKLLLFAPVACTS LEYNI RVYCLHDTHDALKEV 

: 1 II::: 1 : 1 : : I : : 1 : 1 : 1 : : I 1 
QCQHLSTFAVLAQPPKDLTLELAGSPSVPLVIGCTVVSCMALLTLIiAIYA AFWRF 


681 


Db 


895 


948 


Qy 


682 


VQLEKQLGGQLIQEPRVLHFKDSYHNLRLSIHDVPSSLWKSKLLVSYQEIPFYHIWNGTQ 
: : 1 : : 1 1 1 1 : I : : : I I : : | 
IKSERSI ILLNFCLSI — LASNI L I LVGQ S RVL S KGVCTMT A 


741 


Db 


949 


988 


Qy 


742 


: 1 1 1 1 : 1 1 I : 
AFLHFFF LS S FCWVLTEAWQS YLAVT GRMRTRLVRKRFLCLGWGLPALV 


773 


Db 


989 


1037 


Qy 


774 


QSFSINFNITKDTRFAELLALESEAG-VPALVGPSA FKIPFLIRQKI IS 

MM II : 1 1 1 : 1 1 1 1 : 1 1 : : 1 : II 

VAVSVGFTRTKGYGTSSYC^LSLEGGT.TiYAFVRPAAVTVT.VMMT.T^TTVF^K'T MARnnT 


821 


Db 


1038 


1 0Q7 


Qy 


822 


SLDPPCRRGAD WRTL 836 




Db 


1098 


II:: 1 : 1 
DKSKKQRAGSERCPWASL 1115 





RESULT 11 
A40558 

thrombospondin 1 precursor - mouse 
C; Species: Mus musculus (house mouse) 

C;Date: 05-Jun-1992 #sequence_revision 05-Jun-1992 #text_change 09-Jul-2004 
C;Accession: A40558; A37905; B42587; S68787 

R;Lawler, J.; Duquette, M. ; Ferro, P.; Copeland, N.G.; Gilbert, D.J.; Jenkins, 
N.A. 

Genomics 11, 587-600, 1991 

A; Title: Characterization of the murine thrombospondin gene. 

A; Reference number: A40558; MUID: 92128941; PMID: 1774063 

A; Accession: A40558 

A; Status : preliminary 

A;Molecule type: DNA 

A; Residues: 1-1170 <LAW> 

A; Cross-references: UNIPROT : P35441; GB:M62449; GB:M62450; GB:M62451; GB:M62452; 
GB:M62453; GB:M62454; GB:M62455; GB:M62456; GB:M62457; GB:M62458; GB:M62459; 
GB:M62460; GB:M62461; GB:M62462; GB:M62463; GB:M62464; GB:M62465; GB:M62466; 
GB:M62467; GB:M62468; GB:M62469; GB:M62470; NID:g511867; PIDN: AAA50611 . 1 ; 
PID:g511869 

R;Bornstein, P.; Alfi, D.; Devarayalu, S.; Framson, P.; Li, P. 
J. Biol. Chem. 265, 16691-16698, 1990 

A;Title: Characterization of the mouse thrombospondin gene and evaluation of the 

role of the first intron in human gene expression. 

A; Reference number: A37905; MUID: 90375546; PMID:2398070 

A;Accession: A37905 

A; Status : preliminary 

A;Molecule type: DNA 

A; Residues: 1-490 <BOR> 



A; Cross-references: GB:J05605; GB:J05606; NID:g201991; PIDN: AAA40431 . 1 ; 
PID:g554390 

R;Laherty, CD.; O'Rourke, K.; Wolf, F.W. ; Katz, R. ; Seldin, M.F.; Dixit, V.M. 
J. Biol. Chem. 267, 3274-3281, 1992 

A; Title: Characterization of mouse thrombospondin 2 sequence and expression 
during cell growth and development. 

A; Reference number: A42587; MUID : 92147683 ; PMID: 1371115 
A; Accession: B42587 

A; Status: preliminary; not compared with conceptual translation 
A;Molecule type: mRNA 

A; Residues: 1-1152 ,» P ' , 1154-1170 <LAH> 
A;Cross-references : GB:M87276 

A;Note: sequence extracted from NCBI backbone (NCBIP : 81501 ) 
R;Chen, H.; Aeschlimann, D.; Nowlen, J.; Mosher, D.F. 
FEBS Lett. 387, 36-41, 1996 

A;Title: Expression and initial characterization of recombinant mouse 
thrombospondin 1 and thrombospondin 3. 

A; Reference number: S68787; MUID: 96234006; PMID: 8654563 

A; Accession: S68787 

A;Molecule type: protein 

A;Residues: 19-26, 'X 1 , 28-37 <CHE> 

C; Complex: homotrimer, disulfide linked 

C;Superfamily: thrombospondin 1; EGF homology; thrombospondin type 1 repeat 
homology; von Willebrand factor type C repeat homology 
C;Keywords: calcium binding; glycoprotein; homotrimer 
F;l-18/Domain: signal sequence #status predicted <SIG> 
F;19-1170/Product : thrombospondin 1 #status predicted <MAT> 
F;317-375/Domain: von Willebrand factor type C repeat homology <VWC> 
F;378-429/Domain: thrombospondin type 1 repeat homology <THR1> 
F;434-490/Domain: thrombospondin type 1 repeat homology <THR2> 
F;491-547/Domain: thrombospondin type 1 repeat homology <THR3> 
F;551-586/Domain: EGF homology <EGF> 

F;248, 360,708, 1067/Binding site: carbohydrate (Asn) (covalent) #status predicted 

Query Match 5.6%; Score 270.5; DB 2; Length 1170; . 

Best Local Similarity 32.2%; Pred. No. 3.7e-ll; 

Matches 57; Conservative 24; Mismatches 71; Indels 25; Gaps 5; 

Qy 207 WRQARLADTAN YTCVAKN I VAR RRSASAAVIVYVNGGWSTWTEWSVCSASC 258 

: :: I I : I I :: I : | : I I I I I : I I I I : I 

Db 399 IQQRGRSCDSLNNRCEGSSVQTRTCHIQECDKRFKQ DGGWSHWSPWSSCSVTC 451 

Qy 259 GRGWQKRSRSCTNPAPLNGGAFCEGQNVQKTAC-ATLCPVDGSWSPWSKWSACGLDC 314 

II I I I : I : I I III: : II I I :: I I I I I I I : I 
Db 452 GDGVITRIRLCNSPSPQMNGKPCEGEARETKACKKDACPINGGWGPWSPWDICSVTCGGG 511 

Qy 315 THWRSRECSDPAPRNGGEECQGTDLDTRNCTSDLCVHSASGPEDVALYVGLIAVAVC 371 

I I I I : : I I : I I : : I I : : I I II | III 

Db 512 VQRRSRLCNNPTPQFGGKDCVGDVTENQVCNKQDC PIDGCLSNPCFAGAKC 562 



RESULT 12 
TSHUP1 

thrombospondin 1 precursor - human 
C; Species: Homo sapiens (man) 

C;Date: 23-Aug-1987 #sequence_revision 03-Aug-1995 #text_change 09-Jul-2004 
C;Accession: A26155; A34274; A30140; A25812; A05172; A42927 



R;Lawler, J.; Hynes, R.O. 

J. Cell Biol. 103, 1635-1648, 1986 

A; Title: The structure of human thrombospondin, an adhesive glycoprotein with 
multiple calcium-binding sites and homologies with several different proteins. 
A; Reference number: A26155; MUID: 87057617 ; PMID:2430973 
A; Accession: A26155 
A;Molecule type: mRNA 
A; Residues: 1-1170 <LAW> 

A; Cross-references: UNIPROT : P07996; GB:X04665; NID:g37137; PIDN : CAA28370 . 1; 
PID:g37138 

A; Note: parts of this sequence, including the amino end of the mature protein, 
were determined by protein sequencing 
R;Laherty, CD.; Gierman, T.M.; Dixit, V.M. 
J. Biol. Chem. 264, 11222-11227, 1989 

A; Title: Characterization of the promoter region of the human thrombospondin 

gene. DNA sequences within the first intron increase transcription. 

A;Reference number: A34274; MUID: 89291870; PMID:2544587 

A; Accession: A34274 

A; Molecule type: DNA 

A; Residues: 1-166 <LAH> 

A; Cross-references : GB: J04835 

R;Hennessy, S.W.; Frazier, B.A. ; Kim, D.D.; Deckwerth, T.L.; Baumgartel, D.M.; 

Rotwein, P.; Frazier, W.A. 

J. Cell Biol. 108, 729-736, 1989 

A; Title: Complete thrombospondin mRNA sequence includes potential regulatory 
sites in the 3 f untranslated region. 

A; Reference number: A30140; MUID: 89139590; PMID: 2918029 
A; Accession: A30140 
A; Molecule type: mRNA 

A;Residues: 1-83, 'A' , 85-522, 'A' , 524-1170 <HEN> 

A; Cross-references: EMBL: X14787 ; NID:g37464; PIDN: CAA32889 . 1; PID:g37465 

A; Note: parts of this sequence, including the amino end of the mature protein, 

were determined by protein sequencing 

R;Kobayashi, S.; Eden-McCutchan, F. ; Framson, P.; Bornstein, P. 
Biochemistry 25, 8418-8425, 1986 

A; Title: Partial amino acid sequence of human thrombospondin as determined by 

analysis of cDNA clones: homology to malarial circumsporozoite proteins. 

A; Reference number: A25812; MUID: 87157592 ; PMID: 3030396 

A;Accession: A25812 

A; Molecule type: mRNA 

A;Residues: 1-83, 'A 1 , 85-397 <KOB> 

A; Cross-references: GB:M25631; NID:g538353; PIDN:AAA36741 . 1; PID:g538354 

R; Dixit, V.M. ; Hennessy, S.W.; Grant, G.A. ; Rotwein, P.; Frazier, W.A. 

Proc. Natl. Acad. Sci. U.S.A. 83, 5449-5453, 1986 

A; Reference number: A05172; MUID: 86287276; PMID: 3461443 

A;Accession: A05172 

A; Molecule type: mRNA 

A;Residues: 1-83, ! A» , 85-374 , 1 RC ' <DIX> 

A; Cross-references: GB:M14326; NID:g340005; PIDN : AAA61237 . 1 ; PID:g553801 

A; Note: parts of this sequence, including the amino end of the mature protein, 

were determined by protein sequencing 

R;Sun, X.; Skorstengaard, K. ; Mosher, D.F. 

J. Cell Biol. 118, 693-701, 1992 

A; Title: Disulfides modulate RGD-inhibitable cell adhesive activity of 
thrombospondin . 

A; Reference number: A42927; MUID: 92348511; PMID: 1379247 
A;Accession: A42927 



A; Molecule type: protein 
A; Residues: 987-1003 <SUN> 

A;Note: Cys-992 is shown to have a free sulfhydryl 
C; Genetics : 

A;Gene: GDB: THBS1; TSP1; TSP 

A; Cross-references: GDB: 120438; OMIM: 188060 

A; Map position: 15ql5-15ql5 

A;Introns: 23/1 

A; Note: the list of introns may be incomplete 
C; Complex: homotrimer, disulfide linked 
C; Function: 

A; Description: participates in cell migration and adhesion, and in platelet 
aggregation 

C;Superfamily: thrombospondin 1; EGF homology; thrombospondin type 1 repeat 
homology; von Willebrand factor type C repeat homology 
C;Keywords: beta-hydroxyasparagine; calcium binding; cell adhesion; 
glycoprotein; trimer 

F; 1-18/Domain: signal sequence #status predicted <SIG> 

F; 19-1170/Product : thrombospondin 1 #status predicted <MAT> 

F;317-375/Domain: von Willebrand factor type C repeat homology <VWC> 

F;378-429/Domain: thrombospondin type 1 repeat homology <THR1> 

F; 434-4 90/Domain: thrombospondin type 1 repeat homology <THR2> 

F;491-547/Domain: thrombospondin type 1 repeat homology <THR3> 

F;551-586/Domain: EGF homology <EGF1> 

F;650-689/Domain: EGF homology <EGF2> 

F;926-928/Region: cell attachment (R-G-D) motif 

F;171-232/Disulfide bonds: #status predicted 

F;248,360,708,1067/Binding site: carbohydrate (Asn) (covalent) #status predicted 
F;270,274/Disulfide bonds: interchain #status predicted 

F; 610/Modif ied site: erythro-beta-hydroxyasparagine (Asn) #status predicted 
F; 1051/Binding site: carbohydrate (Asn) (covalent) #status absent 



Query Match 5.6%; Score 268.5; DB 1; Length 1170; 

Best Local Similarity 32.9%; Pred. No. 5.1e-ll; 

Matches 51; Conservative 24; Mismatches 61; Indels 19; Gaps 4; 



Qy 


207 


WRQARLADTAN YT CVAKN I VAR RRSASAAVI VYVNGGW STWTEWSVCSASC 

: :: 1 I: 1 1 :: 1 :| : 1 1 1 1 1 : 1 1 1 1 : 1 
I QQRGRS CDSLNNRCEGS SVQT RTCHI QECDKRFKQ DGGWSHWS PWS S CS VTC 


258 


Db 


399 


451 


Qy 


259 


GRGWQKRSRSCTNPAPLNGGAFCEGQNVQKTAC-ATLCPVDGSWSPWSKWSACGLDC 

II 1 1 1 : 1 : 1 1 IN: : II 1 1 :: 1 1 1 1 1 1 1 : 1 
GDGVITRI RLCNSP S PQMNGKPCEGEARETKACKKDACP INGGWGPWS PWDI CSVTCGGG 


314 


Db 


452 


511 


Qy 


315 


THWRSRECSDPAPRNGGEECQGTDLDTRNCTSDLC 349 

1 1 1 1 : : t 1 : 1 1 : :| - 1 : : 1 1 
VQKRSRLCNNPTPQFGGKDCVGDVTENQICNKQDC 546 




Db 


512 





RESULT 13 
A39804 

thrombospondin precursor - chicken 
C; Species: Gallus gallus (chicken) 

C;Date: 10-Sep-1999 #sequence_revision 10-Sep-1999 #text_change 09-Jul-2004 
C;Accession: A39804 

R;Lawler, J.; Duquette, M. ; Ferro, P. 
J. Biol. Chem. 266, 8039-8043, 1991 



A; Title: Cloning and sequencing of chicken thrombospondin. 
A; Reference number: A39804; MUID: 91217026; PMID:2022631 
A;Accession: A39804 
A; Status : preliminary 
A;Molecule type: mRNA 
A; Residues: 1-1178 <LAW> 

A; Cross-references: UNIPROT : P35440; GB:M60853; NID:g212763; PIDN: AAA5 1437 . 1; 
PID:g212764 

C; Super family: thrombospondin 1; EGF homology; thrombospondin type 1 repeat 
homology; von Willebrand factor type C repeat homology 
F;325-383/Domain: von Willebrand factor type C repeat homology <VWC> 
F;386-437/Domain: thrombospondin type 1 repeat homology <THR1> 
F; 4 42-4 98 /Domain : thrombospondin type 1 repeat homology . <THR2> 
F;499-555/Domain: thrombospondin type 1 repeat homology <THR3> 
F;658-697/Domain: EGF homology <EGF> 

Query Match 5.5%; Score 263; DB 1; Length 1178; 

Best Local Similarity 36.2%; Pred. No. 1.3e-10; 

Matches 58; Conservative 16; Mismatches 70; Indels 16; Gaps 5; 

Qy 210 QARLADTANYTCVAKNIVARRRS-ASAAVIV 268 

: I I I : I I I :: I I I I I : I I I I : I I I II 

Db 410 RGRSCDVTRSACTGPHIQTRMCSFKKCDHRIRQDGGWSHWSPWSSCSVTCGVGNITRIRL 469 

Qy 269 CTNPAPLNGGAFCEGQNVQKTACATL-CPVDGSWSPWSKWSACGLDC THWRSRECSD 324 

I : I I II II : I I I I : I I I I I I I I I : I I I I I : 

Db 470 CNSPIPQMGGKNCVGNGRETEKCEKAPCPVNGQWGPWSPWSACTVTCGGGIRERSRLCNS 529 

Qy 325 PAPRNGGEECQGTDLDT RNCTSDLCVHSASGP 356 

I I : I I : I I II | : | | | : : | 

Db 530 PEPQYGGKPCVG DTKQHDMCNKRDCPIDGCLSNPCFP 566 



RESULT 14 
S29126 

properdin precursor [validated] - human 
N; Alternate names: factor P 
C; Species: Homo sapiens (man) 

C;Date: 17-Nov-2000 #sequence_revision 17-Nov-2000 #text_change 09-Jul-2004 
C;Accession: S29126; S16150; A05319; T45112; T45113 

R;Nolan, K.F.; Kaluz, S.; Higgins, J.M.G.; Goundis, D. ; Reid, K.B.M. 
Biochem. J. 287, 291-297, 1992 

A; Title: Characterization of the human properdin gene. 

A; Reference number: S29126; MUID : 93038568 ; PMID: 1417780 

A; Accession: S29126 

A;Molecule type:. DNA 

A; Residues: 1-469 <NOLl> 

A; Cross-references: UNIPROT : P2 7918 ; EMBL:X70872; NID:g35679; PIDN: CAA50220 . 1; 
PID:g35680 

R;Nolan, K.F.; Schwaeble, W. ; Kaluz, S.; Dierich, M.P.; Reid, K.B.M. 
Eur. J. Immunol. 21, 771-776, 1991 

A; Title: Molecular cloning of the cDNA coding for properdin, a positive 

regulator of the alternative pathway of human complement. 

A;Reference number: S16150; MUID : 91184288 ; PMID:2009915 

A; Accession: SI 6150 

A;Molecule type: mRNA 

A; Residues: 1-456, ' R 1 , 458-469 <NOL2> 



A;Cross-references : EMBL:X57748 

R;Reid, K.B.M.; Gagnon, J. 

Mol. Immunol. 18, 949-959, 1981 

A;Reference number: A05319; MUID: 82195224 ; PMID: 7341961 
A; Accession: A05319 
A;Molecule type: protein 

A; Residues: 28-53, 'Q' , 55-59, 'G' ,61, ' 1 », 63; 137-138, 'P', 140-141, , P , ,143- 

144, 'X ',146-148, 'Y',150, 'S',152, »Y',154-156, • XSXGXA' ; 162-163, 'E',165- 

172, 'X', 174-176, , X , ,178, 'V , 180; 223-228, 'X', 230-232, 1 GX 1 , 235-238, 'GH',241- 

245;248-251, 'X', 253-257, f P»,259, 'G»,261, 1 XPP ' , 265-266, ' X », 268-269; 280- 

285, 'X', 287-290, 'X',292, 'H 1 , 294-300, » SXXX' , 305-307, 'X r , 309-315, 1 K', 317; 333- 

341,343-357, 'X', 359-362, 1 EXE 1 ; 393-404, 1 QK' , 407; 421-427, *R f ,429-443, , TKV',447- 

448, 'XX',451, 'RX», 454-455 <REI> 

R;Westberg, J.; Nordin-Fredrikson, G.; Truedsson, L. ; Sjoholm, A.G.; Uhlen, M. 
submitted to the EMBL Data Library, May 1997 
A; Reference number: Z22914 
A; Accession: T45112 

A; Status: translated from GB/EMBL/DDBJ 
A;Molecule type: DNA 

A;Residues: 1-54 , 'X », 56-73, 1 X *, 75-99, 'W 1 , 101-469 <WES1> 
A; Cross-references: EMBL : AF005665; PIDN :AAB63280 . 1 

A; Experimental source: genomic DNA from individual with properdin deficiency 
type II 

A; Accession: T45113 

A; Status: translated from GB/EMBL/DDBJ 
A; Molecule type: DNA 

A;Residues: 1-60, ! X' , 62-413, 1 D 415-452, 'XX 1 , 455-469 <WE2> 
A; Cross-references: EMBL : AF005666; PIDN: AAC5 1626 . 1 

A; Experimental source: genomic DNA from individual with properdin deficiency 
type III 

R;Hartmann, S.; Hofsteenge, J. 

J. Biol. Chem. 275, 28569-28574, 2000 

A; Title: Properdin, the positive regulator of complement, is highly C- 
mannosylated . 

A;Reference number: A59360; MUID: 20435812 ; PMID: 10878002 
A; Contents: annotation 

A; Note: identification and location of C-mannosylation sites by mass- 

spectroscopy 

C; Genetics : 

A; Gene: GDB : PFC 

A; Cross-references: GDB: 120275; OMIM: 312060 
A; Map position: Xpll . 3-Xpll . 23 

A;Introns: 26/1; 76/2; 135/1; 192/1; 256/1; 314/1; 378/1; 415/2 
C; Complex: a mixture of homodimers, homotrimers and homotetramers 
C; Function: 

1: A; Description: protects C3 convertase (C3bBb) from rapid inactivation 
A; Pathway: complement alternate pathway 

C; Super family: human properdin precursor; thrombospondin type 1 repeat homology 
C;Keywords: complement alternate pathway; glycoprotein; homodimer; homotetramer ; 
homo t rimer; plasma 

F; 1-2 7 /Domain : signal sequence #status predicted <SIG> 
F;28-469/Product: properdin #status experimental <MAT> 
F;76-128/Domain: thrombospondin type 1 repeat homology <THR1> 
F; 13 5- 191 /Domain : thrombospondin type 1 repeat homology <THR2> 
F; 192-255/Domain: thrombospondin type 1 repeat homology <THR3> 
F;256-313/Domain: thrombospondin type 1 repeat homology <THR4> 
F;314-377/Domain: thrombospondin type 1 repeat homology <THR5> 



F;378-440/Domain: thrombospondin type 1 repeat homology <THR6> 

F;83,86, 139,142, 145, 196, 199, 260, 263, 321, 324,.382, 385, 388/Modif ied site: 2'- 

mannosyl-tryptophan (Trp) #status experimental 

F;428/Binding site: carbohydrate (Asn) (covalent) #status predicted 

Query Match 5.1%; Score 243; DB 1; Length 469; 

Best Local Similarity 39.5%; Pred. No. l.le-09; 

Matches 45; Conservative 14; Mismatches 43; Indels 12; Gaps 4; 

Qy 243 GGWS TWT EWS VC S AS CGRGWQKRS RS CTN PAP LNGGAFCEGQNVQKTACAT — LCPVDGS 300 

Mill I II :l :| : I hi :||| II III : III :|| |: 
Db 137 GGWSGWGPWEPCSVTCSKGTRTRRRACNHPAPKCGG-HCPGQAQESEACDTQQVCPTHGA 195 

Qy 301 WSPWSKWSACGLDC THWRSRECSDPAP — RNGGEECQGTDLDTRNCT 345 

I : I I : I I I I I : I I I I : I : 1 I : I I I 

Db 196 WATWGPWTPCSASCHGGPHEPKETRSRKCSAPEPSQKPPGKPCPGLAYEQRRCT 249 



RESULT 15 
S05478 

properdin - mouse (fragment) 

C; Species: Mus mus cuius (house mouse) 

C;Date: 07-Sep-1990 #sequence_revision 07-Sep-1990 #text_change 09-Jul-2004 
C;Accession: S05478 
R;Goundis, D.; Reid, K.B.M. 
Nature 335, 82-85, 1988 

A; Title: Properdin, the terminal complement components, thrombospondin and the 
circumsporozoite protein of malaria parasites contain similar sequence motifs. 
A; Reference number: S05478; MUID: 88318954 ; PMID: 3045564 
A;Accession: S05478 
A; Molecule type: mRNA 
A; Residues: 1-437 <GOU> 

A; Cross-references: UNIPROT: P11680; EMBL:X12905; NID:g53786; PIDN : CAA31389 . 1; 
PID:g53787 

C; Complex: a mixture of homodimers, homot rimers and homotetramers 
C; Function: 

A; Description: protects C3 convertase (C3bBb) from rapid inactivation 
A; Pathway: complement alternate pathway 

C; Super family: human properdin precursor; thrombospondin type 1 repeat homology 
C;Keywords: complement alternate pathway; glycoprotein; homodimer; homotetramer; 
homotrimer; plasma 

F; 45-97/Domain: thrombospondin type 1 repeat homology <THR1> 

F; 104-160/Domain: thrombospondin type 1 repeat homology <THR2> 

F; 161-224/Domain: thrombospondin type 1 repeat homology <THR3> 

F;225-282/Domain: thrombospondin type 1 repeat homology <THR4> 

F;283-345/Domain: thrombospondin type 1 repeat homology <THR5> 

F; 346-408/Domain: thrombospondin type 1 repeat homology <THR6> 

F; 52, 55, 108, 111, 114, 165, 168, 229, 232, 290, 293, 350, 353, 356/Modif ied site: 2'- 

mannosyl-tryptophan (Trp) #status predicted 

F;366,396/Binding site: carbohydrate (Asn) (covalent) #status predicted 

Query Match 4.8%; Score 229; DB 2; Length 437; 

Best Local Similarity 40.4%; Pred. No. 9.9e-09; 

Matches 46; Conservative 10; Mismatches 46; Indels 12; Gaps 4; 



Qy 



243 GGW S TWT EW SVC S AS C GRGWQKRS RS CTN PAP LN GGAFC E GQNVQ KT ACAT — LCPVDGS 300 
I I M I I I I : I : I I I I I I I I I II II: I III II I : 



Db 106 GGWSEWGPWGPCSVTCSKGTQIRQRVCDNPAPKCGG-HCPGEAQQSQACDTQKTCPTHGA 164 

Qy 301 WSPWSKWSACGLDC THWRSRECSDPAPRN — GGEECQGTDLDTRNCT 345 

1:111 I 111 II III r | : | | : : I : 

Db 165 WASWGPWSPRSGSCLGGAQEPKETRSRSCSAPAPSHQPPGKPCSGPAYEHKACS 218 



Search completed: March 1, 2005, 09:07:16 
Job time : 38.0558 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2005 Compugen Ltd. 



OM protein - protein search, using sw model 



Run on: 



Title: 

Perfect score: 
Sequence: 

Scoring table: 



Searched: 



March 1, 2005, 09:06:01 ; Search time 172.833 Seconds 

(without alignments) 
1704.439 Million cell updates/sec 



US-10-624-932-2 
4791 

1 MAVRPGLWPALLGIVLAAWL . 



. AVAGLGQPDAGLFTVSEAEC 898 



BLOSUM62 

Gapop 10.0 , Gapext 0.5 

1385339 seqs, 328044528 residues 



Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 



1385339 



Post-processing : 



Database : 



Minimum Match 0% 
Maximum Match 100% 
Listing first 45 summaries 



Published_Applications__AA: * 

1 : /cgn2_6/ptodata/l/pubpaa/US07_PUBCOMB . pep : * 

2: /cgn2_6/ptodata/l/pubpaa/PCT_NEW_PUB.pep:* 

3: /cgn2_6/ptodata/l/pubpaa/US06_NEW_PUB.pep:* 

4 : /cgn2_6/ptodata/l/pubpaa/US06_PUBCOMB.pep:* 

5: /cgn2_6/ptodata/l/pubpaa/US07_NEW_PUB.pep: * 

6: /cgn2_6/ptodata/l/pubpaa/PCTUS_PUBCOMB.pep: 

7: /cgn2_6/ptodata/l/pubpaa/US08_NEW_PUB.pep:* 

8: /cgn2_6/ptodata/l/pubpaa/US08_PUBCOMB.pep:* 

9 : / cgn2_6/ptoda ta/ 1/pubpaa/US 0 9A_PUBCOMB . pep : 

10: /cgn2_6/ptodata/l/pubpaa/US09B_PUBCOMB.pep 

11: /cgn2_6/ptodata/l/pubpaa/US09CJ?UBCOMB.pep 

12: /cgn2_6/ptodata/l/pubpaa/US09_NEW_PUB.pep: 

13: /cgn2_6/ptodata/l/pubpaa/US10A_PUBCOMB.pep 

14 : /cgn2_6/ptodata/l/pubpaa/US10B_PUBCOMB.pep 

15: /cgn2_6/ptodata/l/pubpaa/US10C_PUBCOMB.pep 

16: /cgn2_6/ptodata/l/pubpaa/US10D_PUBCOMB.pep 

17: /cgn2_6/ptodata/l/pubpaa/US10_NEW_PUB.pep: 

18: /cgn2_6/ptodata/l/pubpaa/USll_NEW_PUB.pep: 

19: /cgn2_6/ptodata/l/pubpaa/US60J4EW_PUB.pep: 

20: /cgn2_6/ptodata/l/pubpaa/US60_PUBCOMB.pep: 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 
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ALIGNMENTS 



RESULT 1 
US-09-918-779-2 

; Sequence 2, Application US/09918779 
; Publication No. US20030064369A1 



GENERAL INFORMATION: 



APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 



Taupier, Raymond 
Padigaru, Muralidhara 
Luca 
Steven 
Richard 
Bryan 



Rastelli, 
Spaderna, 
Shimkets, 
Zerhusen, 
Spytek, Kimberly 
Shenoy, Suresh 
Li, Li 

Gusev, Vladimir 
Grosse, William 
Alsobrook, John 
Lepley, Denise 
Burgess, Catherine 
Gerlach, Valerie 
Ellerman, Karen 
MacDougall, John 
Stone, David 
Smithson, Glennda 
TITLE OF INVENTION: Novel Proteins and Nucleic Acids Encoding Same 
FILE REFERENCE: 214 02-074 US 
CURRENT APPLICATION NUMBER: US/09/918,779 
CURRENT FILING DATE: 2001-07-30 
PRIOR APPLICATION NUMBER: 60/221,409 
PRIOR FILING DATE: 2000-07-28 
PRIOR APPLICATION NUMBER: 60/222,840 
PRIOR FILING DATE: 2000-08-04 
PRIOR APPLICATION NUMBER: 60/223,752 
PRIOR FILING DATE: 2000-08-08 
PRIOR APPLICATION NUMBER: 60/223,762 
PRIOR FILING DATE: 2000-08-08 
PRIOR APPLICATION NUMBER: 60/223,770 
PRIOR FILING DATE: 2000-08-08 
PRIOR APPLICATION NUMBER: 60/223,769 
PRIOR FILING DATE: 2000-08-08 
PRIOR APPLICATION NUMBER: 60/225,146 
PRIOR FILING DATE: 2000-08-14 
PRIOR APPLICATION NUMBER: 60/225,392 
PRIOR FILING DATE:' 2000-08-15 
PRIOR APPLICATION NUMBER: 60/225,470 
PRIOR FILING DATE: 2000-08-15 
PRIOR APPLICATION NUMBER: 60/225,697 
PRIOR FILING DATE: 2000-08-16 
PRIOR APPLICATION NUMBER: 60/263,662 
PRIOR FILING DATE: 2001-02-01 
PRIOR APPLICATION NUMBER: 60/281,645 
PRIOR FILING DATE: 2001-04-05 
NUMBER OF SEQ ID NOS: 61 
SOFTWARE: Patentln Ver. 2.1 
SEQ ID NO 2 
LENGTH: 898 
TYPE: PRT 

ORGANISM: Homo sapiens 
US-09-918-779-2 



Query Match 



100.0%; Score 4791; DB 10; Length 898; 



Best Local Similarity 100.0%; Pred. No. 0; 

Matches 898; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



QY 
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MAVT^PGLWPALLGIVLAAWLRGSGAQQSAWANPVPGANPDL^ 


60 






t i i i i i i t i i i i i i i i i i i i t i i i i i i i i i i i i i i i i * i i i i i i i i i i i i i i i i i i i ■ i i 
M 1 II 1 1 1 II 1.1 1 1 II II 1 II 1 1 1 1 1 1 I I I 1 I I I | | | | | | | | | | M | | | | | | | | | | | | M 




Do 


1 


MAVRPGLWPALLGI VLAAWLRGS GAQQS ATYAN PVPGAN PDLLPH FL VEP EDVYI VKNKP 


60 


Qy 


61 


VXLVCKAVPATQIFFKCNGEWVRQVT) 


120 






1 1 M 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 | | | | | | | || | | | | | 




Db 


61 


VLLVCKAVPATQI FFKCNGEWVRQVDHVI ERSTDGSSGLPTMEVRINVSRQQVEKVFGLE 


120 


Qy 


121 


EYWCQCVAWSSSGTTKSQKAYIRIARLRKNFEQEPLAKEVSLEQGIVXPCRPPEGIPPAE 


180 






illlllllltlllltllllllllllllllllllllllfllltllllJIIJitiLiiaiit 

M II 1 1 1 1 1 1 1 II 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 II II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 




Db 


121 


E YWCQCVAWS S S GTTKSQKAYI RI ARLRKNFEQEPLAKEVSLEQGI VLPCRPPEGI PPAE 


180 


Qy 


181 


VT£WLRNEDLVT)PSLDPNWITREHSLvV^ 


240 






M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 I I I I I I I | | | | M | || | | | | || | | | | | | | | | | | | | | 




Db 


181 


VEWLRNEDLVDP S LD PNVYI T REH S LWRQARLADTAN YTCVAKN I VARRRSASAAVTVY 


240 


Qy 


241 


VNGGWSTWTEWSVCSASCGRGWQKRSRSCTNPAPLNGGAFCEGQNVQKTACATLCPVDGS 


300 






1 1 1 M 1 I.I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 II II 1 M 1 1 1 1 1 1 1 1 1 1 II 




Db 


241 


VNGGWS TWT EWS VC S AS C GRGWQKRS RS CTN PAP LNGGAFC EGQNVQKTACATLCPVDGS 


300 


Qy 


301 


WSPWSKWSACGLDCTHWRSRECSDPAPRNGGEECQGTDLDTRNCTSDLCVHSASGPEDVA 


360 






1 M 1 II I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 || I I I I I I | | | | | || | || | | | | | | | | | | | | | | | 




Db 


301 


WSPWSKWSACGLDCTHWRSRECSDPAPRNGGEECQGTDLDTRNCTSDLCVHSASGPEDVA 


360 


Qy 


361 


LWGLIAVAVCLVLLLLVXILVYCRKKEGLDSDVADSSILTSGFQPVSIKPSKADNPHLL 


420 






1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 II 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


361 


LWGLIAVAVCLVXLLLVLILVYCRKKEGLDSDVADSSILTSGFQPVSIKPSKADNPHLL 


420 


Qy 


421 


TIQPDLSTTTTTYQGSLCPRQDGPSPKFQLTNGHLLSPLGGGRHTLHHSSPTSEAEEFVS 


480 






1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I I I | 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


421 


TIQPDLSTTTTTYQGSLCPRQDGPSPKFQLTNGHLLSPLGGGRHTLHHSSPTSEAEEFVS 


480 


Qy 


481 


RLSTQNYFRSLPRGTSNMTYGTFNFLGGRLMIPNTGISLLIPPDAIPRGKIYEIYLTLHK 


540 






1 1 1 1 1 1 M 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 II II 1 1 1 1 1 I I I I I I | | | | | || | | | | | | | | | | | | | | 




Db 


481 


RLSTQNYFRSLPRGTSNMTYGTFNFLGGRLMI PNTGISLLIPPDAIPRGKIYEIYLTLHK 


540 


Qy 


541 


PEDWLPIAGCQTLLSPIVSCGPPGVLLTRPVILAMDHCGEPSPDSWSLRLKKQSCEGSW 


600 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


541 


PEDVRLPLAGCQTLLSPIVSCGPPGVLLTRPVI LAMDHCGEPSPDSWSLRLKKQSCEGSW 


600 


Qy 


601 


EDVLHLGEEAPSHLYYCQLEASACYVFTEQLGRFALVGEALSYAAAKRLKLLLFAPVACT 


660 






1 1 1 1 M 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 || | | | | | | | | 




Db 


601 


EDVIiHLGEEAPSHLYYCQLEASACWFTEQLGRFALVGEMSVAAAKRLKLLLFAPVACT 


660 


Qy 


661 


SLEYNIRVYCLHDTHDALKEWQLEKQLGGQLIQEPRVLHFKDSYHNLRLSIHDVPSSLW 
i i i i i i i i i i i i i i i i i i i i i i \ \ i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i i • i i 


720 


Db 


661 


1 M M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II II 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 I I I 1 1 1 1 
SLEYNIRVYCLHDTHDALKEWQLEKQLGGQLIQEPRVLHFKDSYHNLRLSIHDVPSSLW 


720 


Qy 


721 


KSKLLVSYQEIPFYHIWNGTQRYLHCTFTLERVSPSTSDLACKLWVWQVEGDGQSFSINF 


780 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


721 


KSKLLVSYQEIPFYHIWNGTQRYLHCTFTLERVSPSTSDLACKLWVWQVEGDGQSFSINF 


780 


Qy 


781 


N I T KDT RFAE LLALE S EAGVPALVG P S AFK I PFLIRQKIISSLDP PCRRGADWRT LAQKL 


840 



1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 



Db 781 NIT KDT RFAE LLALE S EAGVPALVG P S AFKI P FL I RQKI I S S L D P PC RRGADWRT LAQKL 840 



Qy 841 HLDSHLSFFASKPSPTAMILNLWEARHFPNGNLSQLAAAVAGLGQPDAGLFTVSEAEC 898 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 841 HLDSHLSFFASKPSPTAMILNLWEARHFPNGNLSQLAAAVAGLGQPDAGLFTVSEAEC 898 



RESULT 2 
US-10-624-932-2 

Sequence 2, Application US/10624932 
Publication No. US20040096877A1 
GENERAL INFORMATION: 
APPLICANT: Taupier, Raymond 
APPLICANT: Padigaru, Muralidhara 
APPLICANT: Rastelli, Luca 
APPLICANT: Spaderna, Steven 
APPLICANT: Shimkets, Richard 
APPLICANT: Zerhusen, Bryan 
APPLICANT: Spytek, Kimberly 
APPLICANT: Shenoy, Suresh 
APPLICANT: Li, Li 
APPLICANT: Gusev, Vladimir 
APPLICANT: Grosse, William 
APPLICANT: Alsobrook, John 
APPLICANT: Lepley, Denise 
APPLICANT: Burgess , Catherine 
APPLICANT: Gerlach, Valerie 
APPLICANT: Ellerman, Karen 
APPLICANT: MacDougall, John 
APPLICANT: Stone, David 
APPLICANT: Smithson, Glennda 

TITLE OF INVENTION: Novel Proteins and Nucleic Acids Encoding Same 
FILE REFERENCE: 21402-074 US 
CURRENT APPLICATION NUMBER: US/10/624,932 
CURRENT FILING DATE: 2003-07-21 
PRIOR APPLICATION NUMBER: 09/918,779 
PRIOR FILING DATE: 2001-07-03 
PRIOR APPLICATION NUMBER: 60/221,409 
PRIOR FILING DATE: 2000-07-28 
PRIOR APPLICATION NUMBER: 60/222,840 
PRIOR FILING DATE: 2000-08-04 
PRIOR APPLICATION NUMBER: 60/223,752 
PRIOR FILING DATE: 2000-08-08 
PRIOR APPLICATION NUMBER: 60/223,762 
PRIOR FILING DATE: 2000-08-08 
PRIOR APPLICATION NUMBER: 60/223,770 
PRIOR FILING DATE: 2000-08-08 
PRIOR APPLICATION NUMBER: 60/223,769 
PRIOR FILING DATE: 2000-08-08 
PRIOR APPLICATION NUMBER: 60/225,146 
PRIOR FILING DATE: 2000-08-14 
PRIOR APPLICATION NUMBER: 60/225,392 
PRIOR FILING DATE: 2000-08-15 
PRIOR APPLICATION NUMBER: 60/225,470 
PRIOR FILING DATE: 2000-08-15 

Remaining Prior Application data removed - See File Wrapper or PALM. 
NUMBER OF SEQ ID NOS: 61 



; SOFTWARE: Patentln Ver, 2.1 
; SEQ ID NO 2 

LENGTH: 898 

TYPE: PRT 
; ORGANISM: Homo sapiens 
US-10-624-932-2 



Query. Match 100.0%; Score 4791; DB 15; Length 898; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 898; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy 


i 


MAWPGLWPALLGIV1W\WLRGSGAQQSATVANPVPGANPDLLPHFI.VEPEDWIVTOJKP 


60 






l|l||||||||Jt||4|ft4||lJ||l||||t||flCiiiiiifti , - 

1 1 1 M 1 1 1 1 1 1 ll 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


i 


MAVRPGLWPALLGIVLAAWLRGSGAQQSATVANPVPGANPDLLPHFLV^PEDVYIVKNKP 


60 


Qy 


61 


VTiLVCKAVPATQI FFKCNGEWVTWDHVI ERSTDGS SGLP™^ 


120 






1 M 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 II 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


61 


VLLVCKAVPATQ I FFKCNGEWVRQVDHVT ERST DGS S GL PTMEVRINVS RQQVEKVFGLE 


120 


Qy 


121 


EYWCQCVAWSSSGTTKSQKAYIRIARLRKNFEQEPLAKEVSLEQGIVLPCRPPEGIPPAE 


180 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


121 


EYWCQCVAWSSSGTTKSQKAYIRIARLRKNFEQEPLAKEVSLEQGIVLPCRPPEGIPPAE 


180 


Qy 


181 


VT£WLRNEDLVT)PSLDPNWITREHSLVVRQARL 


240 






M 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 II 




Db 


181 


VEWL RN E D LVD P S L D PNVY I T REH S LWRQARLADTAN YT CVAKN I VARRRS ASAAVT VY 


240 


Qy 


241 


VNGGWSTWTEWSVCSASCGRGWQKRSRSCTNPAPLNGGAFCEGQNVQKTACATLCPVDGS 


300 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


241 


VNGGWSTWTEWSVCSASCGRGWQKRSRSCTNPAPLNGGAFCEGQNVQKTACATLCPVDGS 


300 


Qy 


301 


WSPWSKWSACGLDCTHWRSRECSDPAPRNGGEECQGTDLDTRNCTSDLCVHSASGPEDVA 


360 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 II 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


301 


WSPWSKWSACGLDCTHWRSRECSDPAPRNGGEECQGTDLDTRNCTSDLCVHSASGPEDVA 


360 


Qy 


361 


LWGLIAVAVCLVLLLLVLILVYCRKKEGLDSDVADSSILTSGFQPVSIKPSKADNPHLL 


420 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I 1 1 1 1 1 1 




Db 


361 


L YVGLI AVAVCLVLLLL VL I LVYCRKKEGLDS DVADS S I LTS GFQPVS I KPS KADN PHLL 


420 


Qy 


421 


TIQPDLSTTTTTYQGSLCPRQDGPSPKFQLTNGHLLSPLGGGRHTLHHSSPTSEAEEFVS 


480 






M 1 1 1 1 1 1 1 II II 1 1 1 II 1 1 1 1 1 1 1 1 II 1 1 1 1 1 I I I I I I I I I I | I | | | | | || | | | || | | | 




Db 


421 


TIQPDLSTTTTTYQGSLCPRQDGPSPKFQLTNGHLLSPLGGGRHTLHHSSPTSEAEEFVS 


480 


Qy 


481 


RLSTQNYFRSLPRGTSNMTYGTFNFLGGRLMIPNTGISLLIPPDAIPRGKIYEIYLTLHK 


540 






1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


481 


RLSTQNYFRSLPRGTSNMTYGTFNFLGGRLMIPNTGISLLIPPDAIPRGKIYEI YLTLHK 


540 


Qy 


541 


PEDVRLPLAGCQTLLSPIVSCGPPGVLLTRPVI LAMDHCGEPSPDSWSLRLKKQSCEGSW 


600 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


541 


PEDVRLPLAGCQTLLSPIVSCGPPGVLLTRPVI LAMDHCGEPSPDSWSLRLKKQSCEGSW 


600 


Qy 


601 


EDVLHLGEEAPSHLYYCQLEASACYVFTEQLGRFALVGEALSVAAAKRLKLLLFAPVACT 


660 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


601 


EDVLHLGEEAPSHLYYCQLEASACYVFTEQLGRFALVGEALSVAAAKRLKLLLFAPVACT 


660 


Qy 


661 


SLEYNIRVYCLHDTHDALKEWQLEKQLGGQLIQEPRVLHFKDSYHNLRLSIHDVPSSLW 


720 



I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I ! I I I I I I I I I I I I I I I I I I I I I II I I I I I 



Db 


661 


SLEYNIRVYCLHDTHDALKEWQLEKQLGGQLIQEPRVLHFKDSYHNLRLSIHDVPSSLW 


720 


Ov 

wy 


721 




/oO 






1 1 1 1 1 1 i 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 I 1 I 1 1 l l l l > I l i i i i i i i i i i i i i i i i i i i 
1 1 M 1 1 i 1 M 1 1 II 1 1 1 II II II M 1 1 1 1 1 1 1 1 1 II 1 II II 1 1 1 I I I I 1 I I I I 1 1 I 1 1 M 




Db 


721 


KSKLLVS YQEI PFYHIWNGTQRYLHCTETLERVS PSTSDLACKLWVWQVEGDGQS ITS INF 


780 






NIT KDT RFAEL LAL E S EAGVPALVG P S AFKI PFLIRQKIISSLDP P CRRGADWRT LAQKL 


840 






1 1 1 1 1 1 i 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ii 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 it i 






7R1 
/ox 


NITKDTRFAELLALESEAGVPALVGPSAFKIPFLIRQKIISSLDPPCRRGADWRTLAQKL 


840 


Qy 


841 


HLDSHLS FFAS KPS PTAMI LNLWEARHFPNGNLSQLAAAVAGLGQPDAGLFTVSEAEC 898 






1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I I I I 1 1 1 1 1 1 1 




Db 


841 


HLDSHLS FFASKPS PTAMI LNLWEARHFPNGNLSQLAAAVAGLGQPDAGLFTVSE1AEC 898 



RESULT 3 
US-09-970-944-2 

; Sequence 2, Application US/09970944 

; Publication No. US20030204052A1 

; GENERAL INFORMATION: 

; APPLICANT: Herrman, John L 

; APPLICANT: Rastelli, Luca 

; APPLICANT: Shimkets, Richard A 

; TITLE OF INVENTION : No. US20030204052Alel Proteins and Nucleic Acids Encoding 
Same and 

; TITLE OF INVENTION: Antibodies Directed Against these Proteins 
; FILE REFERENCE: 21402-138 

; CURRENT APPLICATION NUMBER: US/09/970,944 

; CURRENT FILING DATE: 2002-05-02 

; PRIOR APPLICATION NUMBER: 60/237,862 

; PRIOR FILING DATE: 2000-10-04 

; NUMBER OF SEQ ID NOS : 62 

; SOFTWARE: Patent In Ver. 2.1 

; SEQ ID NO 2 

LENGTH: 899 

TYPE: PRT 
; ORGANISM: Homo sapiens 
US-09-970-944-2 



Query Match 98.1%; Score 4698.5; DB 10; Length 899; 

Best Local Similarity 98.7%; Pred. No. 0; 

Matches 888; Conservative 2; Mismatches 7; Indels 3; Gaps 3; 



Qy 


i 


MAVRPGLWPALLGIVTWVWLRGSGAQQSATVANPVPGANPDLLPHFLV^PEDVTIVICNKP 


60 






1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 II 1 1 1 1 1 1 




Db 


i 


MAVT^PGLWPALLGIVXAAWLRGSGAQQSATVANPVPGANPDLLPHFLV^ 


60 


Qy 


61 


VTiLVCKAVPATQIFFKCNGEWWQVDHVIERSTDGSSGLPTMEVRINVSRQQVEKVFGLE 


120 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 II 1 1 1 1 1 I I I 




Db 


61 


VXLVCKAVTATQIFFKCNGEWWQVI)HVIERSTDGSSGEPTMEvT^INVSRQQV^KVFGLE 


120 


Qy 


121 


EYWCQCVAWSSSGTTKSQKAYIRIARLRKNFEQEPLAKEVSLEQGIVLPCRPPEGIPPAE 


180 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 I I I I I I I I | | | | | | | | | 




Db 


121 


EYWCQCVAWSSSGTTKSQKAYIRIARLRKNFEQEPLAKEVSLEQGIVLPCRPPEGIPPAE 


180 


Qy 


181 


VEW L RN E D LVD P S L D PNVYI T REH S L WRQARLADT AN YT C VAKN I VARRRS AS AAVI VY 


240 



1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 



Db 


181 


VEWLRNEDLVDP S LDPNVYI TREHS LWRQARLADTAN YTCVAKNIVARRRSASAAVIVY 


240 


Ov 


241 


v ii vjvjvw iJ l it i DiK O V ^0/w^UAunyi\i\kj l\o x IN tAT JulN UOLnJ. OLuylN V ^J\X /\w/\X XjVfXT V Lib 


z y y 






II 1 1 II 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 If • 1 • 1 Ml 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II • 1 • 1 ill 




Db 


241 


WGGWSTWTEWSVCSASCGRGWQKRSRSCTNPAPLNGGAFCEGQ^^VHDRTVSSLLVSVDG 


300 


Ov 


300 


SWS. PWSKT/ff^ Af , nT.nPTTTWR c ?RFP c >nPZXPRNnrZ'FT?PnnTnT nTPMPTCinT P'V/Trc'A cmDCnw 
kJMijrH >_> rvw tjj-\\s\j±jLj^ i nv» r\o r\iii^o u c r\C imn vj\jl_jll»^.yoX JJXjU X r\!N X OXJXjv* v no/iO bJTLLI V 








1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 II 1 1 1 1 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 




Db 


301 


SWSPWSKWSACGLDCTHWRSRECSDPAPRNGGEECQGTDLDTRNCTSDLCVHSASGPEDV 


360 




O \J \s 


.rvLi 1 V(JiiXnVn.VL.L VXjXjXjXj VXjX Xj V I V^KI\i\JlLrXjJJDU V/UJOO J. Xii OoT yr Viil\rbI\nJJW cri-Li 








I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

II 1 II 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 II 1 1 1 1 1 II 1 1 1 1 1 1 II 1 1 1 1 1 It 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 




Db 


361 


AL YVGL I AVAVC LVL L L LVL I LVYC RKKEGLD S DVAD S S I LT S G FQ P VS I K P S KADN PHL 


420 


Ov 


4 9 n 


T TTO D T^T QTTTTT Vfl C C T ^DDAFirDC DVTTAT TMPUT T CDT rrrDUTT tttt c onmoi?A TPTTn r 

XjX X\£truuo X X X X X lyboxj^irKyUbirolr J\r y-Lx W^nljijoirijoovjKrix Lnrioor I bhiArjfc*r V 


4 /y 






1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I t l I I I I I I I i I I 
1 II 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 II 1 1 1 1 1 1 1 1 II 1 1 1 1 1 M 1 1 1 II 1 1 | | || | || | | | | | ( | 




Db 


421 


LTIQPDLS-TTTTYQGSLCPRQDGPSPKFQLTNGHLLSPLGGGRHTLHHSSPTSEAEEEV 


479 


Ov 

yy 




QRT CtTnWVTrD QT pDr* r F<SMM r PVr* r P'E r KTTrT /T*DT MT DMTr T QT T T "DDT^A T DDPVT VCTVT itit U 


boy 






1 1 I I 1 I t 1 1 1 1 1 1 t 1 I 1 1 1 t 1 1 1 1 1 1 1 1 t 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 I I 1 1 1 1 1 1 i i i i i i i 
1 II 1 1 1 1 II 1 1 1 M 1 II 1 1 i 1 1 1 II II II II 1 1 1 1 1 1 1 1 M 1 1 1! 1 1 II 1 1 1 1 1 It 1 I II 




Db 


480 


SRLSTQNYFRSLPRGTSNMTYGTFNFLGGRLMI PNTGISLLIPPDAIPRGKIYEIYLTLH 


539 


0\7 

vy 




J\r CjU VKLiFli/vj^y X XjJjoIt 1 V oU^Jr r b VXjJjI KJr VJL 


59y 






1 E 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 } 1 1 1 1 1 1 1 1 t 1 I 1 1 1 1 1 t 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I I I I I I I 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II I 1 1 1 1 1 1 1 I 1 1 II 1 II 1 1 1 1 1 1 1 1 II I I 1 I I I I 




Db 


540 


KPEDVRLPLAGCQTLLS PI VS CGPPGVLLTRPVI LAMDHCGEPS PDSWS LRLKKQSCEGS 


599 


Ov 

wy 


D \J \J 


WT?_ 1"Y\/T HT f^FT A DCUT VVrHT PAQ VinTTffOT C D IT A T A YC TAT CirA A A VDT VT T T IT 1 A 


/•CO 
ODD 






II 1 1 1 1 1 1 1 1 1 i 1 1 1 ) 1 1 I 1 1 t 1 1 1 1 1 1 t I t I I I I t I I I I I I I 1 I I I I I I I i i i t i i i i 
II 1 1 1 1 1 1 1 II 11 1 I I II 1 II 1 1 1 II 1 1 II II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 I I I 




Db 


600 


WEQDVLHLGEEAPSHLYYCQLl^ACWFTEQLGRFALVGF^SVAAAKRLKLLLFAPVA 


659 


Ov 


\j _j _? 


^x oxjEj xin iavi v^xjnw x nu/AJ_»j\Ej v vyxjijJ\vxjoos^ijxyi!jxrKVXjrir i\Do xrliNXjKXjoXrllJ V z do 


*7 1 Q 






1 1 1 1 1 1 I l i t l l l l l l i i i i i i i i i t i t i i i i i i t i i i i i i t t i i i i i i i i i i i i i i i i i 
1 1 1 M II II II 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 II 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


660 


CTSLEYNIRVYCLHDTHDALKEWQLEKQLGGQLIQEPRVLHFKDSYHNLRLSIHDVPSS 


719 


Ov 

wy 


71 9 


xjwno xvxjxj v o xy CjJ.it n iniwwbi y^ 1 XjriL»x r x XjHiKvoit o x oijxxH.l»a.XjVV vwy VHioJJoyor ox 


1 to 






I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i \ 1 1 1 1 

II 1 1 1 1 1 1 II 1 1 1 1 1 1 1 II 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


720 


LWKS KLLVS YQE I P FYH I WNGTQRYLHCT FTLERVS P ST S DLACKLWVWQVEGDGQ S FS I 


779 


Ov 

yy 


77Q 


XT TTTvT TT'l^T^TDTTATT'T T 7S. T 1? QT? A H/DRT UPDC A XPVT D I?T T T>r\VT TOOT nDTD^DDPA T"\TYT r>rp t 7\ r\ 

in tin J. x ftiJXKr/\j^XjXj/\XjjiD£Al7v J\lrr XjlKyj\l±bbijUrrOKKbADWKxLAQ 


o o o 

OOO 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 i 1 1 1 1 1 1 1 1 1 1 t 1 1 1 1 1 i 1 1 1 1 1 1 i I 1 1 1 1 t 1 1 i I t 1 1 1 l 1 I I 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 II 1 1 1 1 1 II 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 




Db 


780 


N FN I T KDTRFAELLAL E S FAGVPALVG P S AFKI P FL I RQ KI ISSLDPPC RRGADWRT LAQ 


839 


Qy 


839 


KLHLDSHLSFFASKPSPTAMIl^LWEARH 


898 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 I ! 1 1 t t 1 1 1 1 1 1 1 1 1 1 l 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 
1 M M 1 M M M M 1 1 II 1 II 1 1 It 1 1 1 1 1 1 1 1 1 t 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 | 1 | | | | | | | 




Db 


840 


KLHLDSHLSFFASKPSPTAMILNLWEARHFPN 


899 


RESULT 4 








US-09-933- 


-261- 


-5 





Sequence 5, Application US/09933261 
Publication No. US20030040046A1 
GENERAL INFORMATION: 

APPLICANT: Tessier-Lavigne, Marc 
Leonardo, E. David 
Hink, Lindsay 
Masu, Masayuki 
Kazuko, Keino-Masu 



TITLE OF INVENTION: Netrin Receptors 
; NUMBER OF SEQUENCES: 8 

CORRESPONDENCE ADDRESS: 

ADDRESSEE: SCIENCE & TECHNOLOGY LAW GROUP 

STREET: 268 BUSH STREET, SUITE 3200 

CITY: SAN FRANCISCO 

STATE: CALIFORNIA 

COUNTRY: USA 

ZIP: 94104 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 

COMPUTER: IBM PC compatible 

OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: Patentln Release #1.0, Version #1.30 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/09/933,261 

FILING DATE: 20-Aug-2001 

CLASSIFICATION: <Unknown> 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 08/808,982 

FILING DATE: <Unknown> 
ATTORNEY/AGENT INFORMATION: 

NAME: OSMAN, RICHARD A 

REGISTRATION NUMBER: 36,627 

REFERENCE/ DOCKET NUMBER: UC96-217 
; TELECOMMUNICATION INFORMATION: 

TELEPHONE: (415) 343-4341 

TELEFAX: (415) 343-4342 
INFORMATION FOR SEQ ID NO: 5: 
; SEQUENCE CHARACTERISTICS: 

LENGTH: 898 amino acids 
; TYPE: amino acid 

STRANDEDNESS: No. US20030040046A1 Relevant 

TOPOLOGY: No. US20030040046A1 Relevant 
MOLECULE TYPE: peptide 
SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
US-09-933-261-5 

Query Match 96.8%; Score 4638; DB 10; Length 898; 

Best Local Similarity 96.0%; Pred. No. 0; 

Matches 862; Conservative 17; Mismatches 19; Indels 0; Gaps 0; 



Qy 1 MAV^PGLWPALLGIVXAAWLRGSGAQQSATVANPVPGANPDLLPHFLVEPEDVYIVKNKP 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I 
Db 1 MAVRPGLWPVXLGIVLAAWLRGSGAQQSATVANPVPGANPDLLPHFLVEPEDVYIVKNKP 60 

Qy 61 VLLVCKAVPATQI FFKCNGEWVRQVDHVI ERSTDGS SGLPTMEVRINVSRQQVEKVFGLE 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I 
Db 61 VLLVCKAVPATQI FFKCNGEWVRQVDHVI ERSTDS S SGLPTMEVRINVSRQQVEKVFGLE 120 

Qy 121 EYWCQCVAWSSSGTTKSQKAYIRIARLRKNFEQEPLAKEVSLEQGIVLPCRPPEGIPPAE 180 

I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I 
Db 121 EYWCQCVAWSSSGTTKSQKAYIRIAYLRKNFEQEPLAKEVSLEQGIVLPCRPPEGIPPAE 180 

Qy 181 VEWLRNEDLVT)PSLDPNVYITREHSLVWQARLADTANYTC^ 240 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I 
Db 181 VEWLRNEDLWPSLDPNVYITREHSLVVRQARLADTANYTCVAKNIVARRRSTSAAVIVY 240 



Qy 241 WGGWSTWTEWSVCSASCGRGWQKRSRSCTNPAPIJ^GGAFCEGQWQKTACATLCPVDGS 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I 

Db 241 VNGGWSTWTEWSVCSASCGRGWQKRSRSCTNPAPLNGGAFCEGQNVQKTACATLCPVDGS 300 

Qy 301 WSPWSKWSACGLDCTHWRSRECSDPAPRNGGEECQGTDLDTRNCTSDLCVHSASGPEDVA 360 

II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I : I : I I I I II I 

Db 301 WSSWSKWSACGLDCTHWRSRECSDPAPRNGGEECRGADLDTRNCTSDLCLHTASCPEDVA 360 

Qy 361 LYVGLIAVAVCLVLLLLVLILVYCRKKEGLDSDVADSSILTSGFQPVSIKPSKADNPHLL 420 

I I : I I : I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 361 L YI GLVAVAVCLFLLLLALGL I YCRKKEGLDS DVAD SSI LT S GFQPVS I KP S KADNPHLL 420 

Qy 421 TIQPDLSTTTTTYQGSLCPRQDGPSPKFQLTNGHLLSPLGGGRHTLHHSSPTSEAEEFVS 480 

I I I I I I I I I I I I I I II I I I I I I I I I I I I I : I I I I I I I I I I I I II I I I I I I I M I : I! I 

Db 421 TIQPDLSTTTTTYQGSLCSRQDGPSPKFQLSNGHLLSPLGSGRHTLHHSSPTSEAEDFVS 480 

Qy 481 RLSTQNYFRSLPRGTSNMTYGTFNFLGGRLMI PNTGISLLIPPDAIPRGKIYEIYLTLHK 540 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 481 RLSTQNYFRSLPRGTSNMAYGT FN FLGGRLMI PNTGISLLIPPDAIPRGKIYEIYLTLHK 540 

Qy 541 PEDVRLPLAGCQTLLSPIVSCGPPGVLLTRPVTLAMDHCGEPSPDSWSLRLKKQSCEGSW 600 

I I I I I I I I I I I II I I I I : I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I 

Db 541 PEDWLPLAGCQTLLSPWSCGPPGVLLTRPVILAMDHCGEPSPDSWSLRLKKQSCEGSW 600 

Qy 601 EDVLHLGEEAPSHLYYCQLEASACYVFTEQLGRFALVGEALSVAAAKRLKLLLFAPVACT 660 

I I I I I I I II : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I I I I I 

Db 601 EDVLHLGEESPSHLYYCQLEAGACYVFTEQLGRFALVGEALSVAATKRLRLLLFAPVACT 660 

Qy 661 SLEYNIRVYCLHDTHDALKEWQLEKQLGGQLIQEPRVLHFKDSYHNLRLSIHDVPSSLW 720 

I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I 

Db 661 SLEYNIRVYCLHDTHDALKEWQLEKQLGGQLIQEPRVLHFKDSYHNLRLSIHDVPSSLW 720 

Qy 721 KSKLLVSYQEIPFYHIWNGTQRYLHCTFTLERVSPSTSDLACKLWVWQVEGDGQSFSINF 780 

I I I I I I I I I I I I I I I I I I I I I : I I I I I I I I I I : : I I I I I I I I : I I I I I I I I I I I I : I II 

Db 721 KSKLLVSYQEIPFYHIWNGTQQYLHCTFTLERINASTSDLACKVWVWQVEGDGQSFNINF 780 

Qy 781 NIT KDT RFAE LLALE S EAGVPALVG P S AFKI P FL I RQK 1 1 S S LD P PCRRGADWRTLAQKL 840 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I I I I II I I I I I I I 

Db 781 NITKDTRFAELLALESEGGVPALVGPSAFKIPFLIRQKIIASLDPPCSRGADWRTLAQKL 840 

Qy 841 HLDSHLSFFASKPSPTAMILNLWEARHFPNGNLSQLAAAVAGLGQPDAGLFTVSEAEC 898 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 
Db 841 HLDSHLSFFASKPSPTAMILNLWEARHFPNGNLGQLAAAVAGLGQPDAGLFTVSEAEC 898 



RESULT 5 

US-09-970-944-13 

; Sequence 13, Application US/09970944 

; Publication No. US20030204052A1 

; GENERAL INFORMATION: 

; APPLICANT: Herman, John L 

; APPLICANT: Rastelli, Luca 

; APPLICANT: Shimkets, Richard A 

; TITLE OF INVENTION: No. US20030204052Alel Proteins and Nucleic Acids Encoding 
Same and 

; TITLE OF INVENTION: Antibodies Directed Against these Proteins 



; FILE REFERENCE: 21402-138 

; CURRENT APPLICATION NUMBER: US/09/970, 944 

; CURRENT FILING DATE: 2002-05-02 

; PRIOR APPLICATION NUMBER: 60/237,862 

; PRIOR FILING DATE: 2000-10-04 

; NUMBER OF SEQ ID NOS : 62 

; SOFTWARE: Patent In Ver. 2.1 

; SEQ ID NO 13 

LENGTH: 898 

TYPE : PRT 
; ORGANISM: Rattus norvegicus 
US-09-970-944-13 



Query Match 96.8%; Score 4638; DB 10; Length 898; 

Best Local Similarity 96.0%; Pred. No. 0; 

Matches 862; Conservative 17; Mismatches 19; Indels 0; Gaps 0; 



Qy 


1 


MAVT^PGLWPALLGIvliAAWLRGSGAQQSATVANPVPGANPDLLPHFLV^ 


60 






1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 II 1 1 1 1 I 1 1 1 1 1 1 1 




Db 


1 


MAVTRPGLWPVTjLGIVXAAWLRGSGAQQSATVANPW 


60 


Qy 


61 


VLLVCKAVPATQ I FFKCNGEWVRQVDHVT ERSTDGS S GLPTMEVRI NVS RQQVEKVFGLE 


120 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


61 


VXLVCKAVPATQIFFKCNGEWV^QVT)HVIERSTDSSSGLPTMEVRINVSRQQVEKVFGLE 


120 


Qy 


121 


EYWCQCVAWSSSGTTKSQKAYIRIARLRKNFEQEPLAKEVSLEQGIVLPCRPPEGIPPAE 


180 






1 M 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 

■ " » » » Illllllllllllll 1 ' 1 1 1 1 t 1 1 1 1 1 1 1 [ 1 1 1 1 ( 1 1 1 1 1 1 1 l 1 1 1 1 | | 




Db 


121 


EYWCQCVAWSSSGTTKSQKAYIRIAYLRKNFEQEPLAKEVSLEQGIVXPCRPPEGIPPAE 


180 


Qy 


181 


v™lrnedlvt)psldpnwitrehslwrqarladta^ 


240 






1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 II 1 II 1 II II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


181 


V^WLRNEDLVTlPSLDPNvTITREHSLvVRQARLADTA^ 


240 


Qy 


241 


WGGWSTWTEWSVCSASCGRGWQKRSRSCTNPAPLNGGAFCEGQNVQKTACATLCPVDGS 


300 






1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 II II II 1 1 II 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


241 


VNGGWSTWTEWSVCSASCGRGWQKRSRSCTNPAPLNGGAFCEGQNVQKTACATLCPVDGS 


300 


Qy 


301 


WSPWSKWSACGLDCTHWRSRECSDPAPRNGGEECQGTDLDTRNCTSDLCVHSASGPEDVA 


360 






II 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I I : I Mlllill llli:|:tl Mill 




Db 


301 


WSSWSKWSACGLDCTHWRSRECSDPAPRNGGEECRGADLDTRNCTSDLCLHTASCPEDVA 


360 


Qy 


361 


L WGLI AVAVCLVTiLLLVXI LVYCRKKEGLDSDVADS S I LTSGFQPVS I KPSKADNPHLL 


420 






11:11:11111.1 INI 1 1:111 III II MM Mil II III Mill II II III III II 




Db 


361 


LYI GLVAVAVCLFLLLLALGLI YCRKKEGLDSDVADS S I LTSGFQPVS I KPS KADNPHLL 


420 


Qy 


421 


TIQPDLSTTTTTYQGSLCPRQDGPSPKFQLTNGHLLSPLGGGRHTLHHSSPTSEAEEFVS 


480 






1 1 1 1 1 1 1 M 1 1 M M II 1 1 1 II 1 M II 1 1 : 1 1 M II 1 1 1 II II 1 II 1 M 1 1 1 II : II 1 




Db 


421 


TIQPDLSTTTTTYQGSLCSRQDGPSPKFQLSNGHLLSPLGSGRHTLHHSSPTSEAEDFVS 


480 


Qy 


481 


RLSTQNYFRSLPRGTSNMTYGTFNFLGGRLMI PNTGISLLIPPDAIPRGKIYEIYLTLHK 


540 






1 1 1 1 M II M 1 II 1 II II 1 II 1 II 1 II 1 II 1 M 1 II 1 1 II II II II II 1 1 1 1 1 1 II II 1 




Db 


481 


RL S TQN Y FRS L P RGT SNMAYGT FN FLGGRLMI PNTGI SLLI PPDAI PRGKI YEI YLTLHK 


540 


Qy 


541 


PEDVRLPLAGCQTLLSPIVSCGPPGVLLTRPVI LAMDHCGEPSPDSWSLRLKKQSCEGSW 


600 






1 1 1 M 1 II M II 1 II M : II II II 1 II M II II II II M II II II II 1 II II II II 1 1 M 




Db 


541 


PEDVRLPLAGCQTLLSPWSCGPPGVLLTRPVI LAMDHCGEPSPDSWSLRLKKQSCEGSW 


600 







c*Ltv ±inu^CjtlJ\ironLi iLyLCiAoAHVr 1 Ejyjj^Kt AljVCjCiAJjb VAAAKRLKLLLFAPVACT 


660 






1 1 1 1 1 1 t 1 1 • 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 I 1 1 1 1 1 1 I 1 1 I 1 1 . 1 | | 1 1 i i 1 I i 

1 i 1 1 1 M 1 1 . 1 1 I I I I 1 1 1 1 1 II II II 1 1 II II 1 1 1 1 II 1 M | | | | | : I I | 1 | | 1 | | J 




Db 


601 


EDVLHLGEESPSHLYYCQLEAGACYVFTEQLGRFALVGEALSVAATKRLRLLLFAPVACT 


660 


vy 


uul 


oJjCj liN J.r\v I K^unu 1 nUALiIxIW vyjjE4j\yjjVj(jyij±yrjlrKVljri£ ]\]Jb IrtN LKLblrilJVlrbbLW 


Ton 






1 1 1 1 1 1 1 1 1 i l l l l l i l l l l t l l i i i i i i i i i i t i t i i i i i i i i i t i i i i i i i i i i i i i i 
1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 II 1 II 1 II 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


661 


SLEYNIRVYCLHDTHDALKEWQLEKQLGGQLIQEPRVLHFKDSYHNLRLSIHDVPSSLW 


720 


A.. 


791 


r\.b JMjjjVb i ytilFr irllWJN(jiyKYLriUI r I LhKVSFSl SDIiACKLWVWQVEGDGQSF SINF 


ion 

780 






1 1 1 1 | 1 I 1 1 1 1 1 1 1 1 1 I I I 1 1 , 1 1 1 I 1 1 1 1 | 1 , , llllllll.lllltlllllll.il! 

1 1 1 1 M 1 1 1 1 1 1 1 1 M 1 1 1 1 1 : 1 1 1 1 1 1 1 1 1 1 : : 1 II I II M : 1 1 II II II 1 II 1 - II 1 




Db 


721 


KSKLLVSYQEIPFYHIWNGTQQYLHCTFTLERINASTSDIACKVWVWQVEGDGQSFNINF 


780 


Qy 




N I T KDT RFAE L LAL E S EAGVP AL VG P S AFK IPFLIRQKIISSLDPPC RRG AD WRT LAQ KL 


840 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 :| I I I I I 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


781 


NITKDTRFAELLALESEGGVPALVGPSAFKIPFLIRQKIIASLDPPCSRGADWRTLAQKL 


840 


Qy 


841 


HLDSHLSFFASKPSPTAMILNLWEARHFPNGNLSQLAAAVAGLGQPDAGLFTVSEAEC 898 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 




Db 


841 


HLDSHLSFFASKPSPTAMILNLWEARHFPNGNLGQLAAAVAGLGQPDAGLFTVSEAEC 898 


RESULT 6 








US-10-256- 


-702- 


-5 





Sequence 5, Application US/10256702 
Publication No. US20030059859A1 
GENERAL INFORMATION: 

APPLICANT: Tessier-Lavigne, Marc 
Leonardo , E. David 
Hink, Lindsay 
Masu, Masayuki 
Kazuko, Keino-Masu 
TITLE OF INVENTION: Netrin Receptors 
NUMBER OF SEQUENCES: 8 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: SCIENCE & TECHNOLOGY LAW GROUP 
STREET: 268 BUSH STREET, SUITE 3200 
CITY: SAN FRANCISCO 
STATE: CALIFORNIA 
COUNTRY: USA 
ZIP: 94104 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS-DOS 
SOFTWARE: Patentln Release #1.0, Version #1.30 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/ 10/256, 702 
FILING DATE: 27-Sep-2002 
CLASSIFICATION: <Unknown> 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: US/09/933,261 
FILING DATE: 20-Aug-2001 
APPLICATION NUMBER: 08/808,982 
FILING DATE: <Unknown> 
ATTORNEY/ AGENT INFORMATION: 
NAME: OSMAN, RICHARD A 
REGISTRATION NUMBER: 36,627 



REFERENCE/ DOCKET NUMBER: UC96-217 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: (415) 343-4341 
TELEFAX: (415) 343-4342 
INFORMATION FOR SEQ ID NO: 5: 
SEQUENCE CHARACTERISTICS: 
; LENGTH: 898 amino acids 

; TYPE: amino acid 

STRANDEDNESS: No. US20030059859A1 Relevant 
TOPOLOGY: No. US20030059859A1 Relevant 
MOLECULE TYPE: peptide 
SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
US-10-256-702-5 



Query Match 96.8%; Score 4638; DB 14; Length 898; 

Best Local Similarity 96.0%; Pred. No. 0; 

Matches 862; Conservative 17; Mismatches 19; Indels 0; Gaps 0; 



Qy 


1 


MAWPGLWPALLGIVXAAWLRGSGAQQSATVANPVPGANPDLLPHFLVTS^ 


60 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 




Db 


1 


MAVRP GLW P VLLGI VLAAWLRG S GAQQ S ATVAN P VP GANP DLL P H FLVE P E DVY I VKN K P 


60 


Qy 


61 


VXLVCKAVPATQIFFKCNGEWVT^QVT^HVIERSTDGSSGLPTMEWINVSRQQVEKVFGLE 


120 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


61 


VXLVCKAVTATQIFFKCNGEWVKQVT)HVI 


120 


Qy 


121 


EYWCQCVAWS S SGTTKSQKAYI RI ARLRKNFEQEPLAKEVS LEQGI VLPCRP PEGI PPAE 


180 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 




Db 


121 


EYWCQCVAWS S SGTTKSQKAYI RIAYLRKNFEQEPLAKEVSLEQGI VLPCRP PEGI PPAE 


180 


Qy 


181 


VEWLRNEDLVD P S LD PNVYI TREH S L VVRQARLADTAN YTCVAKN I VARRRS AS AAVT VY 


240 






M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 




Db 


181 


VEWLRNEDLVD P S LDPNVYI TREHS LWRQARLADTANYTCVAKNI VARRRS TS AAVT VY 


240 


Qy 


241 


VNGGWS TWT EW S VC S AS C GRGWQ KRS RS CTN P AP LN GGAFC EGQNVQ KT ACAT LC P VDGS 


300 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


241 


VNGGWSTWTEWSVCSASCGRGWQKRSRSCTNPAPLNGGAFCEGQNVQKTACATLCPVDGS 


300 


Qy 


301 


WSPWSKWSACGLDCTHWRSRECSDPAPRNGGEECQGTDLDTRNCTSDLCVHSASGPEDVA 


360 






II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 : 1 1 1 1 1 1 1 1 1 1 1 1 1 : 1 : 1 1 1 1 1 1 1 




Db 


301 


WSSWSKWSACGLDCTHWRSRECSDPAPRNGGEECRGADLDTRNCTSDLCLHTASCPEDVA 


360 


Qy 


361 


LYVGLIAVAVCLVLLLLVLILVYCRKKEGLDSDVADSSILTSGFQPVSIKPSKADNPHLL 


420 






11:11:111111 1 1 1 1 1 1 : 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 




Db 


361 


LYIGLVAVAVCLFLLLLALGLIYCRKKEGLDSDVADSSILTSGFQPVSIKPSKADNPHLL 


420 


Qy 


421 


TIQPDLSTTTTTYQGSLCPRQDGPSPKFQLTNGHLLSPLGGGRHTLHHSSPTSEAEEFVS 


480 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 : 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 : 1 1 1 




Db 


■ 421 


TIQPDLSTTTTTYQGSLCSRQDGPSPKFQLSNGHLLSPLGSGRHTLHHSSPTSEAEDFVS 


480 


Qy 


481 


RLSTQNYFRSLPRGTSNMTYGTFNFLGGRLMI PNTGISLLIPPDAIPRGKIYEIYLTLHK 


540 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 II 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II II 1 




Db 


481 


RLSTQNYFRSLPRGTSNMAYGT FN FLGGRLMI PNTGISLLIPPDAIPRGKIYEIYLTLHK 


540 


Qy 


541 


PEDVRLPLAGCQTLLS PI VSCGPPGVLLTRPVI LAMDHCGEPS PDSWSLRLKKQSCEGSW 


600 






1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 : 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 




Db 


541 


PEDVRLPLAGCQTLLS PWSCGPPGVLLTRPVI LAMDHCGEPS PDSWSLRLKKQSCEGSW 


600 





601 


RnVT.HT.nFFZXP^HT.VVf*nT.FA c »Ar , YVFTPnT.r;DRaT Vf^TTZlT Q\/ , a a APTDT VT T T FZVPVarT 
i-ti-J v JLiri J_ivjiiii_i/-vir jfiL x X L-yiid/wriVrf i vi i iZi\£ ±j\j i\C t-\ i ■ v ItTiM 1 1 o V/\rt/\f\.r\ijI\ljijXir r\T VrW*. 1 


DDU 






M 1 1 1 1 1 1 1 : 1 1 1 1 I I I I 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 I I I I I I I I I I I I I : I I I I 1 1 1 1 1 I 




Db 


601 


EDVLHLGEESPSHLYYCQLEAGACYVFTEQLGRFALVGEALSVAATKRLRLLLFAPVACT 


660 




661 




79n 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


661 


SLEYNIRVYCLHDTHDALKEWQLEKQLGGQLIQEPRVLHFKDSYHNLRLSIHDVPSSLW 


720 


vy 


791 


K" ^JfT.TA/*? YOFT PPVT-TTTXTMnTn'OVT HPTTTT'T VDA/'QDClTQ'nT STWT TjJ\/I*7r^\rK , r , nr'/ 0 4C CCTXTTT 

i\d rvjjij vo x^Ij± rt inxwiNoiyi\iijriv-.i r l LijKvoro 1 olJJa/\^i\ijW vwy vEiLjUoyor oIIn r 


Ton 
/ o U 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 : 1 1 1 1 1 1 1 1 1 1 :: 1 1 1 1 1 1 1 1 : 1 1 1 1 1 1 II 1 1 1 1 : 1 1 1 




Db 


721 


KSKLLVSYQEIPFYHIWNGTQQYLHCTFTLERINASTSDl^CKVWVWQVEGDGQSFNINF 


780 


Ox/ 

yy 


7ft1 

/Ol 


NITKDTRFAELIALESEAGVPALVGPSAFKIPFLIRQKIISSLDPPCRRGADWRTLAQKL 


840 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 : 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 






7 B 1 


NITKDTRFAELLALESEGGVPALVGPSAFKIPFLIRQKIIASLDPPCSRGADWRTLAQKL 


840 


Qy 


841 


HLDSHLSFFASKPSPTAMILNLWEARHFPNGNLSQLAAAVAGLGQPDAGLFTVSEAEC 898 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 




Db 


841 


HLDSHLSFFA3KPSPTAMILNLWEAPJiFPNGNLGQLAAAVAGLGQPDAGLFTVSElAEC 898 


RESULT 7 








US-10-240- 


-154- 


-16 





Sequence 16,. Application US/10240154 
Publication No. US20030175741A1 



; GENERAL INFORMATION: 

; APPLICANT: Cochran et al . 

; TITLE OF INVENTION: SCHIZOPHRENIA RELATED GENES 

; FILE REFERENCE: CKFW-P01-006 

; CURRENT APPLICATION NUMBER: US/10/240,154 

; CURRENT FILING DATE: 2001-04-02 

; PRIOR APPLICATION NUMBER: PCT/GB01/014 86 

; PRIOR FILING DATE: 2001-04-02 

; NUMBER OF SEQ ID NOS : 34 

; SOFTWARE: Patentln version 3.2 

; SEQ ID NO 16 

LENGTH: 898 

TYPE: PRT 
; ORGANISM: Rattus sp. 
US-10-240-154-16 

Query Match 96.8%; Score 4638; DB 14; Length 898; 

Best Local Similarity 96.0%; Pred. No. 0; 

Matches 862; Conservative 17; Mismatches 19; Indels 0; Gaps 0; 



Qy 1 MAVT^PGLWPALLGIvXAAWLRGSGAQQSATVANPVPGANPDLLPHFLV^ 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1 MAVT^PGLWPVTjLGIVXAAWLRGSGAQQSATVANPVPGANP 60 

Qy 61 VXLVCKAWATQIFFKCNGEWV^QvT^^ 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I 
Db 61 VXLVCKAVPATQI FFKCNGEWVRQVDHVI ERSTDS S SGLPTMEVRINVSRQQVEKVFGLE 120 

Qy 121 EYWCQCVAWSSSGTTKSQKAYIRIARLRKNFEQEPLAKEVSLEQGIVLPCRPPEGIPPAE 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I II I I I I I I I I I 
Db 121 EYWCQCVAWSSSGTTKSQKAYIRIAYLRKNFEQEPLAKEVSLEQGIVLPCRPPEGIPPAE 180 



yy 


1 R1 


v ij« jjruNCjjjjj v utroij u c jn v ii i Krjfioij v VKy/\KXi/\JJi/\ivi.i 1 U V/vlvOJ J. VAKKK o Ao A/VV 1 V I 








1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


181 


VEWLImEDLVDPSLDP^rVYITREHSLVVRQARLADTANYTCVAKNIVARRRSTSAAVIVY 


240 


yy 


941 


VJNiaiaWo 1 W 1 Hi Wo VLo/voLbKbWyRKoKoL 1 JN JrAPLJNLjljAr v^EiGyN Vyj\l AUAI LUr VDGb 








i i i i i i i i i i i i i i i i i i i i i i m i i i i i l i i i i i i i i i i i i i i i i i i i i i i i i i i i i i l 




Db 


241 


VNGGWSTWTEWSVCSASCGRGWQKRSRSCTNPAPLNGGAFCEGQNVQKTACATLCPVDGS 


300 


Qy 


^ni 

Jul 


WorWoRWoALbLULinWKbKlljtDUrArK^ DLD1 KNCTSDLCVHSA5GPEDVA 








II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 : 1 1 1 1 1 1 1 1 1 1 1 1 1 : 1 : 1 1 1 1 1 1 1 




Db 


301 


WSSWSKWSACGLDCTHWRSRECSDPAPRNGGEECRGADLDTRNCTSDLCLHTASCPEDVA 


360 


yy 




Li I V {j Jj 1AV A V U Li V Jj li L Li V L 1 li V I U KKKkGLDbDVALJb bl L I SGFyPVSI KPSKADNPHLL 


420 






M : 1 1 : 1 1 1 1 1 1 1 1 1 1 1 1 : 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


361 


LYIGLVAVAVCLFLLLLALGLIYCRKKEGLDSDVADSSILTSGFQPVSIKPSKADNPHLL 


420 


Qy 


H.£ 1 


1 lyjrDLbl I I 1 I xyGSLCPRyDGPSPKFQLTNGnLLSPLGGGRHTLHHSSPTSEAEEFvS 


480 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1: 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 : 1 1 1 




Db 


421 


TIQPDLSTTTTTYQGSLCSRQDGPSPKFQLSNGHLLSPLGSGRHTLHHSSPTSEAEDFVS 


480 


Qy 


4 ft 1 
ft O 1 


KLb 1 yJM i r KbLrKGl oJMMl ibl cNc LGGRLMI FN 1 Gl SLLLPPDAl PRGKI YEI YLTLHK 


c a rv 

540 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


481 


RLSTQNYFRSLPRGTSNMAYGTFNFLGGRLMI PNTGISLLIPPDAI PRGKI YEI YLTLHK 


540 


Qy 




Tl "C TYl JTi T tv T TV /^P^rnT T O tv Tl fp /"»/*■• tv tv/"^ TT T m TV tit. t V t tv TkirTvTT/"»/"*T7» tv o tvt\ otvt rt t t>t t ^ tt r\. r* #-i ^.t 

PfcDVKLPLAGCQTLLSPIVSCGPPGVLLTRPVILAMDHCGEPSPDSWSLl^ 


600 






1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 : 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


541 


PEDVRLPLAGCQTLLSPWSCGPPGVLLTRPVILAMDHCGEPSPDSWSLRLKKQSCEGSW 


600 


Qy 


Dill 


V Ln LGb hiAir b HL I YCQ LEAS AC YVFTEy 


660 






1 1 1 1 1 1 1 1 1 : 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 111:1111111111 




Db 


601 


EDVLHLGEESPSHLYYCQLEAGACYVFTEQLGRFALVGEALSVAATKRLRLLLFAPVACT 


660 


yy 


DDI 


b LEYN I RVYCLHDTHDALKEVVQLEKQLGGQLIQEPRVLHFKDS YHNLRLS I HDVPS SLW 


720 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


661 


SLEYNIRVYCLHDTHDALKEWQLEKQLGGQLIQEPRVLHFKDSYHNLRLSIHDVPSSLW 


720 


Qy 


791 


AbivLLiVb xyblrr xnlWI>JGiyKxLrlL.l r i LERVbPS I SDLACKLVrNAVQVEGDGQSFSINF 


780 






1 1 1 1 1 1 1 1 1 1 I II 1 1 1 1 1 1 1 1 : I 1 1 1 1 1 1 1 1 1 : : 1 1 1 1 1 1 1 1 : 1 1 1 1 1 1 1 1 1 1 1 1 : 1 1 1 




Db 


721 


KSKLLVSYQEIPFYHIVmGTQQYLHCTFTLERINASTSDIACKVWVWQVEGDGQSFNINF 


780 


Qy 


"7 ft 1 


NITKDTRFAELLALESEAGVPALVGPSAFKIPFLIRQKIISSLDPPCRRGADWRTLAQKL 


840 






1 1 1 II 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 : 1 1 1 1 II IIIIIIIIIMI 




Db 


781 


NITKDTRFAELLALESEGGVPALVGPSAFKIPFLIRQKI IASLDPPCSRGADWRTLAQKL 


840 


Qy 


841 


HLDSHLSFFASKPSPTAMILNLWEARHFPNGNLSQLAAAVAGLGQPDAGLFTVSEAEC 898 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 




Db 


841 


HLD S H L S FFAS K P S P TAMI LN LWE1ARH FPN GNLGQ LAAAVAGLGQ P DAGL FTVS EAEC 898 



RESULT 8 
US-10-311-623-1 

; Sequence 1, Application US/10311623 
; Publication No. US20040023244A1 
; GENERAL INFORMATION: 

; APPLICANT: INCYTE GENOMICS, INC.; GRIFFIN, Jennifer A. 
; APPLICANT: KALLICK, Deborah A.; TRIBOULEY, Catherine M. 



APPLICANT: YUE, Henry; NGUYEN, Danniel B. 
APPLICANT: TANG, Y. Tom; LAL, Preeti G. 
APPLICANT: POLICKY, Jennifer L.; AZIMZAI, Yalda 
APPLICANT: LU, Dyung Aina M. ; GRAUL, Richard C. 
APPLICANT: YAO, Monique G. ; BURFORD, Neil 
APPLICANT: HAFALIA, April J. A.; BAUGHN, Marian R. 
APPLICANT: BANDMAN, Olga; ARVIZU, Chandra S. 
APPLICANT: YANG, Junming; XU, Yuming 
APPLICANT: GANDHI, Ameena R. ; WARREN, Bridget A. 
APPLICANT: DING, Li; SANJANWALA, Madhusudan M. 
APPLICANT: DUGGAN, Brendan M. ; LU, Yan 
TITLE OF INVENTION: RECEPTORS 
FILE REFERENCE: PF-0793 USN 
CURRENT APPLICATION NUMBER: US/10/311,623 
CURRENT FILING DATE: 2002-12-17 
PRIOR APPLICATION NUMBER: US 01/19942 
PRIOR FILING DATE: 2001-06-21 
PRIOR APPLICATION NUMBER: US 60/214,027 
PRIOR FILING DATE: 2000-06-21 
PRIOR APPLICATION NUMBER: US 60/228,045 
PRIOR FILING DATE: 2000-08-25 
PRIOR APPLICATION NUMBER: US 60/255,104 
PRIOR FILING DATE: 2000-12-12 
NUMBER OF SEQ ID NOS : 24 
SOFTWARE: PERL Program 
SEQ ID NO 1 
LENGTH: 842 
TYPE: PRT 

ORGANISM: Homo sapiens 
FEATURE: 

NAME/KEY: misc_f eature 

OTHER INFORMATION: Incyte ID No. US20040023244A1 6052371CD1 
US-10-311-623-1 



Query Match 92.1%; Score 4413; DB 15; Length 842; 

Best Local Similarity 93.5%; Pred. No. 0; 

Matches 840; Conservative 1; Mismatches 1; Indels 56; Gaps 1; 



Qy 1 MAWPGLWPALLGIVXAAWLRGSGAQQSATVANPVPGANPDLLPHFLV^PEDVTIVKNKP 60 

I I I I I I I I I I II I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I I I I I I I 

Db 1 MAVRP GLW PALLGI VLAAWLRG S GAQQ SAT VAN P VP GAN P D LL P H FLVE P EDVYI VKN KP 60 

Qy 61 VLLVCKAVPATQI FFKCNGEWVRQVDHVI ERSTDGS SGLPTMEVRINVSRQQVEKVFGLE 120 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 61 VLLVCKAVPATQI FFKCNGEWVRQVDHVI ERSTDGS SGLPTMEVRINVSRQQVEKVFGLE 120 

Qy 121 EYWCQCVAWSSSGTTKSQKAYIRIARLRKNFEQEPLAKEVSLEQGIVLPCRPPEGIPPAE 180 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 121 EYWCQCVAWSSSGTTKSQKAYIRIAYLRKNFEQEPLAKEVSLEQGIVLPCRPPEGIPPAE 180 

Qy 181 VEWLRNEDLVT>PSLDPNVYITREHSLVVRQARLADTA^ 240 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I 

Db 181 V^WLRNEDLVT^PSLDPNWITREHSLWRQARI^TANYTCVAKNIVARRRSASAAVIW 240 



Qy 241 VNGGWSTWTEWSVCSASCGRGWQKRSRSCTNPAPLNGGAFCEGQNVQKTACATLCPVDGS 300 

I I I I 

Db 241 VDGS 244 









oDU 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 : 1 1 1 1 1 1 1 1 




Db 


245 


WSPWSKWSACGLDCTHWRSRECSDPAPRNGGEECQGTDLDTRNCTSDLCVHTASGPEDVA 


304 


yy 




t v\/t*t t 7±\m\rr' t "\tt t t t i/t tt \rvr*X3wvt* , m ncniranccn TcrTnDTrcTVDcvnnMDUT t 
1j I VoijXAVAV^ljVijJjJjJjVijl Li V I^Ki\J\CjCjJjUoU VADo oXJj 1 oGfc y F VbIKFblYAJJ.N FnLL 


/ion 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


305 


LYVGLIAVAVCLVLLLLVLILVYCRKKEGLDSDVADSSILTSGFQPVSIKPSKADNPHLL 


364 


Qy 


fl/l J. 


TTADnT Q rnmmmmvnr CT PDO/^HrDO D VTPr\T Tiki f"*T-I T T COT /""*/"' fTJTjrpT UUC C DTCT?AT?P CT/O 

1 lyxrUliOl 1111 iyboljLrKyiJbrorrvcyLilNbnLLbrLbb^ Vb 


/ton 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


365 


TIQPDLSTTTTTYQGSLCPRQDGPSPKFQLTNGHLLSPLGGGRHTLHHSSPTSEAEEFVS 


424 


Qy 


A Q 1 
ft O ± 


KLio 1 yiM jl r Kblif Kbl oWMl ivjI rNr LCjGKLiMIFN IGIbLLlPPDAIPRGKI YEI YLTLHK 


04U 






1 1 1 1 1 1 1 1 ii 1 1 1 1 ii 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


425 


RLSTQNYFRSLPRGTSNMTYGTFNFLGGRLMIPNTGISLLIPPDAIPRGKIYEI YLTLHK 


484 


Qy 


Dfl X 


FhDVKLPLAGCQ 1 LLb PI VSCGPPGVLLTRPVI LAMDHCGEPS PDSWSLRLKKQSCEGSW 


600 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ii 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


485 


PEDVRLPLAGCQTLLSPIVSCGPPGVLLTRPVILAMDHCGEPSPDSWSLRLKKQSCEGSW 


544 


Qy 




EDVLHLGEEAPSHLYYCQLEASACYVFTEQLGRFALVGEALSVAAAKRLKLLLFAPVACT 


660 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


545 


EDVLHLGEEAPSHLYYCQLEASACYVFTEQLGRFALVGEALSVAAAKRLKLLLFAPVACT 


604 


Qy 


DDI 


SLEYNIRviCLHDTHDALKEVVQLEKQLGGQLIQEPRVLHFKDSYHNLRLSIHDVPSSLW 


720 






1 1 1 1 ii 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


605 


SLEYNIRVYCLHDTHDALKEWQLEKQLGGQLIQEPRVLHFKDSYHNLRLSIHDVPSSLW 


664 


Hit 

Qy 


X 


KSKLLVS YQEI P FYH I WNGTQRYLHCT F TLERVS PSTS DLACKLWVWQVEGDGQS FS INF 


780 






II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 i II 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


665 


KSKLLVS YQEI P FYH I WNGTQRYLHCT FTLERVS PST S DLACKLWVWQVEGDGQS FS INF 


724 


A,, 

Qy 


/ o 1 


NITKDTRFAELLALESEAGVPALVGPSAFKIPFLIRQKIISSLDPPCRRGADWRTLAQKL 


840 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


725 


NITKDTRFAELLALESEAGVPALVGPSAFKIPFLIRQKIISSLDPPCRRGADWRTLAQKL 


784 


Qy 


841 


HLDSHLSFFASKPSPTAMILNLWEARHFPNGNLSQLAAAVAGLGQPDAGLFTVSEAEC 898 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


785 


H LD S H L S FFAS KP S PTAMI LNLWEARH FPNGN L S QLAAAVAGLGQ P DAGL FT VS EAEC 842 



RESULT 9 

US-09-970-944-14 

; Sequence 14, Application US/09970944 

; Publication No. US20030204052A1 

; GENERAL INFORMATION: 

; APPLICANT: Herrman, John L 

; APPLICANT: Rastelli, Luca 

; APPLICANT: Shimkets, Richard A 

; TITLE OF INVENTION: No. US20030204052Alel Proteins and Nucleic Acids Encoding 
Same and 

; TITLE OF INVENTION: Antibodies Directed Against these Proteins 
; FILE REFERENCE: 21402-138 

; CURRENT APPLICATION NUMBER: US/09/970,944 

; CURRENT FILING DATE: 2002-05-02 

; PRIOR APPLICATION NUMBER: 60/237,862 



; PRIOR FILING DATE: 2000-10-04 
; NUMBER OF SEQ ID NOS : 62 ... 
; SOFTWARE: Patentln Ver. 2.1 
; SEQ ID NO 14 

LENGTH: 544 

TYPE: PRT 
; ORGANISM: Homo sapiens 
US-09-970-944-14 



Query Match 59.4%; Score 2845; DB 10; Length 544; 

Best Local Similarity 100.0%; Pred. No. 2.7e-225; 

Matches 541; Conservative 0; Mismatches 0; Indels 0; Gaps 0; 



Qy 


358 


DVAL YVGL I AVAVCLVLLL LVL I LVYC RKKEGL D S DVAD S S I LT S GFQ PVS I KP S KADN P 


417 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 




Db 


4 


DVAL YVGLI AVAVCLVLLLLVL I LVYCRKKEGLDS DVADS S I LT S GFQ PVS I KP S KADN P 


63 


Qy 


418 


HLLTIQPDLSTTTTTYQGSLCPRQDGPSPKFQLTNGHLLSPLGGGRHTLHHSSPTSEAEE 


477 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


64 


HLLTIQPDLSTTTTTYQGSLCPRQDGPSPKFQLTNGHLLSPLGGGRHTLHHSSPTSEAEE 


123 


Qy 


478 


FVSRLSTQNYFRSLPRGTSNMTYGTFNFLGGRLMIPNTGISLLIPPDAIPRGKIYEIYLT 


537 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


124 


FVSRLSTQNYFRSLPRGTSNMTYGTFNFLGGRLMIPNTGISLLIPPDAIPRGKIYEIYLT 


183 


Qy 


538 


LHKPEDVIUjPLAGCQTLLSPIVSCGPPGVLLTRPVTLAMDHCGEPSPDSWSLRLKKQSCE 


597 






1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


184 


LHKPEDVRLPLAGCQTLLSPIVSCGPPGVLLTRPVILAMDHCGEPSPDSWSLRLKKQSCE 


243 


Qy 


598 


GSWEDVXHLGEE^SHLYYCQLEASACYVTTEQLGRFALVGEALSVAAAKRLKLLLFAPV 


657 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


244 


GSWEDVLHLGEEAPSHLYYCQLEASACYVFTEQLGRFALVGEALSVAAAKRLKLLLFAPV 


303 


Qy 


658 


ACTSLEYNIRVYCLHDTHDALKEWQLEKQLGGQLIQEPRVLHFKDSYHNLRLSIHDVPS 


717 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


304 


ACTSLEYNIRVYCLHDTHDALKEWQLEKQLGGQLIQEPRVLHFKDSYHNLRLSIHDVPS 


363 


Qy 


718 


SLWKSKLLVSYQEIPFYHIWNGTQRYLHCTFTLERVSPSTSDLACKLWVWQVEGDGQSFS 


777 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


364 


SLWKSKLLVSYQEIPFYHIWNGTQRYLHCTFTLERVSPSTSDLACKLWVWQVEGDGQSFS 


423 


Qy 


778 


I N FN I T KDT RFAELLALE S EAGVP ALVGP S AFK I P FL I RQK 1 1 S S LD P PC RRGADWRTLA 


837 






1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


424 


INFNITKDTRFAELLALESEAGVPALVGPSAFKIPFLIRQKIISSLDPPCRRGADWRTLA 


483 


Qy 


838 


QKLHLDSHLSFFASKPSPTAMILNLWEARHFPNGNLSQLAAAVAGLGQPDAGLFTVSEAE 


897 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


484 


QKLHLDSHLSFFASKPSPTAMILNLWEARHFPNGNLSQLAAAVAGLGQPDAGLFTVSEAE 


543 


Qy 


898 


C 898 




Db 


544 


1 

C 544 





RESULT 10 
US-09-933-261-6 

; Sequence 6, Application US/09933261 



; Publication No. US20030040046A1 

GENERAL INFORMATION: 
; APPLICANT: Tessier-Lavigne, Marc 

; Leonardo, E. David 

; Hink, Lindsay 

; Masu, Masayuki 

; Kazuko, Keino-Masu 

TITLE OF INVENTION: Netrin Receptors 
NUMBER OF SEQUENCES: 8 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: SCIENCE & TECHNOLOGY LAW GROUP 
; STREET: 268 BUSH STREET, SUITE 3200 

; CITY: SAN FRANCISCO 

STATE: CALIFORNIA 
COUNTRY: USA 
ZIP: 94104 
; COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC- DOS/MS-DOS 
SOFTWARE: Patentln Release #1.0, Version #1.30 
; CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/09/933,261 
FILING DATE: 20-Aug-2001 
CLASSIFICATION: <Unknown> 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: 08/808,982 
FILING DATE: <Unknown> 
ATTORNEY/ AGENT INFORMATION: 
; NAME: OSMAN, RICHARD A 

REGISTRATION NUMBER: 36,627 
REFERENCE/DOCKET NUMBER: UC96-217 
; TELECOMMUNICATION INFORMATION: 

; TELEPHONE: (415) 343-4341 

TELEFAX: (415) 343-4342 
INFORMATION FOR SEQ ID NO: 6: 
SEQUENCE CHARACTERISTICS: 
; LENGTH: 557 amino acids 

; TYPE: amino acid 

STRANDEDNESS: No. US20030040046A1 Relevant 
TOPOLOGY: No. US20030040046A1 Relevant 
MOLECULE TYPE: peptide ' 
; SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

US-09-933-261-6 



Query Match 58.8.%; Score 2815.5; DB 10; Length 557; 

Best Local Similarity 96.8%; Pred. No. 7.4e-223; 

Matches 539; Conservative 2; Mismatches 15; Indels 1; Gaps 1; 

Qy 343 N CT S DLCVH S AS G P EDVAL YVGL I AVAVCLVLLL LVL I LVYC RKKE GL D S DVAD S S I LT S 402 

I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I 
Db 1 N CT S DLXVHT AS G P E DVAL YVGL I AVAVCLVL L L LVL I LVYCRKKE GL D S DVAD S S I LT S 60 



Qy 403 GFQPVSIKPSKADNPHLLTIQPDLSTTTTTYQGSLCPRQDGPSPKFQLTNGHLLSPLGGG 462 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 61 GFQPVSIKPSKADNPHLLTIQPDLSTTTTTYQGSLCPRQDGPSPKFQLTNGHLLSPLGGG 120 



Qy 463 RHTLHHSSPTSEAEEFVSRLSTQNYFRSLPRGTSNMTYGTFNFLGGRLMIPNTGISLLIP 522 

I I I I I I I I I I I I I I I I II I I I I I I 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 121 RHTLHHSSPTSEAEEFVSRLSTQNYFRSLPRGTSNMTYGTFNFLGGRLMIPNTGISLLIP 180 

Qy 523 PDAIPRGKIYEIYLTLHKPEDVRLPLAGCQTLLSPIVSCGPPGVLLTRPVTLAMDHCGEP 582 

I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 181 PDAIPRGKIYEIYLTLHKPEDVRLPLAGCQTLLSPIVSCGPPGVLLTRPVI LAMDHCGEP 240 

Qy 583 SPDSWSLRLKKQSCEGSWEDVLHLGEEAPSHLYYCQLEASACYVFTEQLGRFALVGEALS 642 

I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I II I I I I I I I I I I I I I I I I I I I I I 
Db 241 SPDSWSLALKKQSCEGSWEDVLHLGEEAPSHLYYCQLEASACYVFTEQLGRFALVGEALS 300 

Qy 643 VAAAKRLKLLLFAPVACTSLEYNIRVYCLHDTHDALKEWQLEKQLGGQLIQEPRVLHFK 702 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I II I I I I I I I I I 
Db 301 VAAAKRLKLLLFAPVACTSLEYNIRVYCLHDTHDALKEWQLEKQLGGQLIQEPRVLHLX 360 

Qy 703 DSYHNLRLSIHDVPSSLWKSKLLVSYQEIPFYHIWNGTQRYLHCTFTLERVSPSTSDLAC 762 

I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I 
Db 361 DSYHNLXLSXHDVPSSLWKSKLLVSYQEIPFYHIWNGTQRYLHCTFTLERVSPSTSDLAC 420 

Qy 763 KLWVWQVEGDGQSFSINFNITKDTRFAELLALESEAGVPALVGPSAFKIPFLIRQKIISS 822 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 421 KLWVWQVEGDGQSFSINFNITKDTRFAELLALESEAGVPALVGPSAFKIPFLIRQKIISS 480 

Qy 823 LDPPCRRGADWRTLAQKLHLDSHLSFFASKPSPTAMILNLWEARHFPNGNLSQLAAAVAG 882 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I II I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 481 LDPPCRRGADWRTLAQKLHLDSHLSFFASKPSPTAMILNLWEARHFPNGNLSQLAAAVAG 540 

Qy 883 LGQPDAGLFT-VSEAEC 898 

I : I I I I I 
Db 541 TXPAGRWLLSQCSEAEC 557 



RESULT 11 
US-10-256-702-6 

; Sequence 6, Application US/10256702 
; Publication No. US20030059859A1 

GENERAL INFORMATION: 
; APPLICANT: Tessier-Lavigne, Marc 

; Leonardo , E. David 

; Hink, Lindsay 

; Masu, Masayuki 

; Kazuko, Keino-Masu 

TITLE OF INVENTION: Netrin Receptors 

NUMBER OF SEQUENCES: 8 
; CORRESPONDENCE ADDRESS: .. . 

; ADDRESSEE: SCIENCE & TECHNOLOGY LAW GROUP 

STREET: 268 BUSH STREET, SUITE 3200 

CITY: SAN FRANCISCO 

STATE: CALIFORNIA 
; COUNTRY: USA 

ZIP: 94104 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 

COMPUTER: IBM PC compatible 
; OPERATING SYSTEM: PC-DOS/MS-DOS 

; SOFTWARE: Patentln Release #1.0, Version #1.30 



CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/10/256, 702 
FILING DATE: 27-Sep-2002 
CLASSIFICATION: <Unknown> 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: US/09/933,261 
FILING DATE: 20-Aug-2001 
APPLICATION NUMBER: 08/808,982 
FILING DATE: <Unknown> 
ATTORNEY/ AGENT INFORMATION: 
NAME: OSMAN, RICHARD A 
REGISTRATION NUMBER: 36,627 
REFERENCE/ DOCKET NUMBER: UC96-217 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: (415) 343-4341 
TELEFAX: (415) 343-4342 
INFORMATION FOR SEQ ID NO: 6: 
SEQUENCE CHARACTERISTICS: 

LENGTH: 557 amino acids 
TYPE: amino acid 

STRANDEDNESS: No. US20030059859A1 Relevant 
TOPOLOGY: No. US20030059859A1 Relevant 
MOLECULE TYPE: peptide 
SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
US-10-256-702-6 

Query Match 58.8%; Score 2815.5; DB 14; Length 557; 

Best Local Similarity 96.8%; Pred. No. 7.4e-223; 

Matches 539; Conservative 2; Mismatches 15; Indels 1; Gaps 1; 

Qy 343 NCTSDLCVHSASGPEDVALYVGLIAVAVCLVLLLLVLILVYCRKKEGLDSDVADSSILTS 402 

I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I II II I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1 NCT S DLXVHTAS G P E DVAL YVGL I AVAVC LVLL L LVL I LVYC RKKEGL D S DVAD S S I LT S 60 

Qy 403 GFQPVSIKPSKADNPHLLTIQPDLSTTTTTYQGSLCPRQDGPSPKFQLTNGHLLSPLGGG 462 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 61 GFQPVSIKPSKADNPHLLTIQPDLSTTTTTYQGSLCPRQDGPSPKFQLTNGHLLSPLGGG 120 

Qy 463 RHTLHHSSPTSEAEEFVSRLSTQNYFRSLPRGTSNMTYGTFNFLGGRLMIPNTGISLLIP 522 

I I I I I I I I I I I I I II I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 121 RHTLHHSSPTSEAEEFVSRLSTQNYFRSLPRGTSNMTYGTFNFLGGRLMIPNTGISLLIP 180 

Qy 523 PDAIPRGKIYEIYLTLHKPEDVT^LPLAGCQTLLSPIVSCGPPGVLLTRPVILiAMDHCGEP 582 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I I i I I 

Db 181 PDAIPRGKIYEIYLTLHKPEDVRLPLAGCQTLLSPIVSCGPPGVLLTRPVILAMDHCGEP 240 

Qy 583 SPDSWSLRLKKQSCEGSWEDVLHLGEEAPSHLYYCQLEASACYVFTEQLGRFALVGEALS 642 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I 
Db 241 SPDSWSLALKKQSCEGSWEDVLHLGEEAPSHLYYCQLEASACYVFTEQLGRFALVGEALS 300 

Qy 643 V7W^KRLKLLLFAPVACTSLEYNIRVYCLHDTHDALKEVVQLEKQLGGQLIQEPRVLHFK 702 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I 
Db 301 VAAAKRLKLLLFAPVACTSLEYNIRVYCLHDTHDALKEWQLEKQLGGQLIQEPRVLHLX 360 



Qy 

Db 



703 
361 



DSYHNLRLSIHDVPSSLWKSKLLVSYQEIPFYHIWNGTQRYLHCTFTLERVSPSTSDLAC 
I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
DSYHNLXLSXHDVPSSLWKSKLLVSYQEIPFYHIWNGTQRYLHCTFTLERVSPSTSDLAC 



762 
420 



Qy 

Db 



763 
421 



KLWVWQVEGDGQS FSINFNITKDT RFAELLALES EAGVPALVGPSAFKI PFLIRQKIISS 822 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
KLWVWQVEGDGQS FS I N FN I TKDT RFAELLALES EAGVPALVGPSAFKI PFLI RQKI I S S 480 



Qy 823 LDP PCRRGADWRTLAQKLHLDSHLS FFAS KPS PTAMI LNLWEARHFPNGNLSQLAAAVAG 882 

I I i I I I I I I I II I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 481 LDPPCRRGADWRTLAQKLHLDSHLS FFAS KPS PTAMI LNLWEIARHFPNGNLSQLAAAVAG 540 

Qy 883 LGQPDAGLFT-VSEAEC 898 

I : I I I I I 
Db 541 TXPAGRWLLSQCSEAEC 557 



RESULT 12 
US-09-970-944-15 

; Sequence 15, Application US/09970944 

; Publication No. US20030204052A1 

; GENERAL INFORMATION: 

; APPLICANT: Herrman, John L 

; APPLICANT: Rastelli, Luca 

; APPLICANT: Shimkets, Richard A 

; TITLE OF INVENTION: No. US20030204052Alel Proteins and Nucleic Acids Encoding 
Same and 

; TITLE OF INVENTION: Antibodies Directed Against these Proteins 
; FILE REFERENCE: 21402-138 

; CURRENT APPLICATION NUMBER: US/09/970,944 

; CURRENT FILING DATE: 2002-05-02 

; PRIOR APPLICATION NUMBER: 60/237,862 

; PRIOR FILING DATE: 2000-10-04 

; NUMBER OF SEQ ID NOS : 62 

; SOFTWARE: Patentln Ver. 2.1 

; SEQ ID NO 15 

LENGTH: 931 

TYPE: PRT 
; ORGANISM: Caenorhabditis elegans 
US-09-970-944-15 



Query Match 58.2%; Score 2787; DB 10; Length 931; 

Best Local Similarity 57.3%; Pred. No. 3.5e-220; 

Matches 522; Conservative 153; Mismatches 208; Indels 28; Gaps 9; 



Qy 


9 


PALLGIVLAAWLRGS GAQQSA TVANPVPGANPDLLPHFLVEPEDVYIVKNKPVLLVC 


65 






III :|:| II II : 1 1 : 1 1 1 1 1 : 1 1 1 : I I 1 II I I I I 1 




Db 


26 


PAL— ALLSASGTGSAAQDDEFFHELPETFPSDPPEPLPHFLIEPEEAYIVKNKPVNLYC 


83 


Qy 


66 


KAVPATQIFFKCNGEWWQVTDHVIERSTDGSSGLPTMEVI^INVSRQQV^KVFGLEEYWCQ 


125 






II 1 1 1 1 1 : 1 1 1 1 III 1 III:: 1 : 1 1 1 II 1 : 1 1 1 1 1 1 :: 1 1 1 : 1 1 1 1 




Db 


84 


KAS PATQI YFKCNSEWVIiQKDHVVDERVDETSGLI VREVS I EI SRQQVEELFGPEDYWCQ 


143 


Qy 


126 


CVAWSSSGTTKSQKAYIRIARLRKNFEQEPLAKEVSLEQGIVTjPCRPPEGIPPAEVEWLR 


185 






1 1 1 1 1 1 : 1 1 1 1 1 : 1 1 1 : 1 1 1 III llllll lllllll ::| MINIM MINI: 




Db 


144 


CVAWSSAGTTKSRKAYVRIAYLRKTFEQEPLGKEVSLEQEVLLQCRPPEGIPVAEVEWLK 


203 


Qy 


186 


N E DLVD P S LD PNVY I T REH S LWRQARLADTAN YT C VAKN I VARRRS AS AAVT VYVN GGW 


245 






1 1 1 : : 1 1 : Mill : 1 : 1 : : : 1 1 II : 1 1 1 1 1 1 II 1 1 II 1 1 : 1 : 1 :| MUM Ml 




Db 


204 


NEDIIDPAEDRNFYITIDHNLIIKQARLSDTANYTCVAKNIVAKRKSTTATVIVYVNGGW 


263 



Qy 246 STWTEWSVCSASCGRGWQKRSRSCTNPAPLNGGAFCEGQNVQKTACATLCPVDGSWSPWS 305 

Mlllllll:: I I I I : I I I : I : I I I I I I I I I 1 I I I I I I : I I I II I I I I E I |: || 
Db 264 STWTEWSVCNSRCGRGYQKRTRTCTNPAPLNGGAFCEGQSVQKIACTTLCPVDGRWTSWS 323 

Qy 306 KWSACGLDCTHWRSRECSDPAPRNGGEECQGTDLDTRNCTSDLCVHSASGPEDVALYVGL 365 

III II : I I I I I III: II I : I I I : : I I I : : I I I I I : : I : I I I I I I I : 
Db 324 KWSTCGTECTHWRRRECTAPAPKNGGKDCDGLVLQSKNCTDGLCMQAAPDSDDVALYVGI 383 

Qy 366 -IAVAVCLVLLLLVLILVYCRKKEGLDSDVADSSILTSGFQPVSIKPSKADNPHLLTIQP 424 

I I I I I I : : : I : I I : : I I : I I I I I I I I I : I I : : I I I : I 
Db 384 VI AVTVC LAI T VWAL FVYRKNHRD FE S D 1 1 DS S ALN GGFQ PVN I KAARQ D LLAVPP 440 

Qy 425 DLSTTTTTYQGSLCPRQDGPSPKFQLTNGHLLSPLGGGRHTLHHSS PTSEAEEFVS 480 

II:: I : I : I II : I I : I I I : : : : I I I : III 

Db 441 DLTSAAAMYRGPVYALHD-VSDKIPMTNSPILDPLPNLKIKVYNSSGAVTPQDDLAEFSS 499 

Qy 481 RLS TQNYF RSLPRGT — SNMT YGTFNFLGGRLMI PNTGI SLLI PPDAI 526 

: I I II: : I I I I I : I I I I I I I I : I I I : I : II I I I II 

Db 500 KLSPQMTQSLLENEALNLKNQSLARQTDPSCTAFGTFNSLGGHLIIPNSGVSLLIPAGAI 559 

Qy 527 PRGKIYEIYLTLHKPEDVRLPLAGCQTLLSPIVSCGPPGVLLTRPVILAMDHCGEPSPDS 586 

I : I : : I I : I : I : I : I : : I I : I I I I : I : I I I I I I I I I I I I I I I : II : I I : 
Db 560 PQGRVYEMYVTVHRKENMRPPMEDSQTLLTPWSCGPPGALLTRPVILTLHHCADPSTED 619 

Qy 587 WSLRLKKQSCEGSWEDVLHLGEEAPSHLYYCQLEASACYVFTEQLGRFALVGEALSVAAA 646 

I : : I I I : : I I I I I : : I I I : I I I : I I I : : I I I : I I I I : : : I I I 
Db 620 WKI QLKNQAVQGQWE DWWGEEN FTT P C YI QLDAEACH I LT EN L ST YAL VGQ S TT KAAA 679 

Qy 647 KRLKLLLFAPVACTS LEYNI RVYCLHDTHDALKEWQLEKQLGGQLIQEPRVLHFKDS YH 706 

I I I I I : I I : 1:1111:111111 II I I I I I I : I I I : I : I I I I : : I I : I I I I I I 
Db 680 KRLKLAI FGPLCCSSLEYSIRVYCLDDTQDALKEVLQLERQMGGQLLEEPKALHFKGSIH 739 

Qy 707 NLRLSIHDVPSSLWKSKLLVSYQEIPFYHIWNGTQRYLHCTFTLERVSPSTSDLACKLWV 766 

I I I I I I I I : I I I I I I I I I I II I I I I I I : I : I I I I I I I I I I I : I : I : I I I I I 
Db 740 NLRLSIHDIAHSLWKSKLLAKYQEIPFYHIWSGSQRNLHCTFTLERLSLNTVELVCKLCV 799 

Qy 767 WQVEGDGQSFSINFNITKDTRFAELLALESEAGVPALVGPSAFKIPFLIRQKIISSLDPP 826 

1111:11 I :| :::: :| |: : : : Mill II INI: I I I I I 
Db 800 RQVEGEGQIFQLNCTVSEEPTGIDLPLLDPASTITTVTGPSAFSIPLPIRQKLCSSLDAP 859 

Qy 827 CRRGADWRTIAQKLHLDSHLSFFASKPSPTAMILNLWE7VRHFPNGNLSQLAAAVAGLGQP 886 

II III II I I : I I : I : : I I : I III : I I : I I I I : : I I : I I I I III : : I : 
Db 860 QTRGHDWRMLAHKLNLDRYLNYFATKSSPTGVTLDLWEAQNFPDGNLSMLAAVLEEMGRH 919 

Qy 887 DAGLFTVSEAE 897 

• l 

Db 920 ETWS LAAEGQ 930 



RESULT 13 
US-09-972-211-121 

; Sequence 121, Application US/09972211 

; Publication No. US20040048245A1 

; GENERAL INFORMATION: 

; APPLICANT: Shimkets, Richard A 

; APPLICANT: Taupier Jr, Raymond J 



APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 
APPLICANT 



Burgess, Catherine E 
Zerhusen, Bryan D 
Mezes, Peter S 
Rastelli, Luca 
Malyankar, Uriel M 
Grosse, William M 
Alsobrook II, John P 
Lepley, Denise M 
Spytek, Kimberly Ann 
Li , Li 

Edinger, Shlomit 
Gerlach, Valerie 
Ellerman, Karen 
MacDougall, John R 
Gunther, Erik 
Millet, Isabelle 
Stone, David J 
Smithson, Glennda 
Szekeres Jr, Edward S 

No. US20040048245Alel Human Proteins, Polynucleotides 



APPLICANT 
TITLE OF INVENTION 
Encoding Them And 

TITLE OF INVENTION: Methods Of Using The Same 
FILE REFERENCE: 21402-141 

CURRENT APPLICATION NUMBER: US/09/972,211 
CURRENT FILING DATE: 2001-10-05 
PRIOR APPLICATION NUMBER: 60/238,325 
PRIOR FILING DATE: 2000-10-05 
PRIOR APPLICATION NUMBER: 60/238,323 
PRIOR FILING DATE: 2000-10-05 
PRIOR APPLICATION NUMBER: 60/238,400 
PRIOR FILING DATE: 2000-10-06 
PRIOR APPLICATION NUMBER: 60/238,397 
PRIOR FILING DATE: 2000-10-06 
PRIOR APPLICATION NUMBER: 60/238,401 
PRIOR FILING DATE: 2000-10-06 
PRIOR APPLICATION NUMBER: 60/238,379 
PRIOR FILING DATE: 2000-10-06 
PRIOR APPLICATION NUMBER: 60/238,402 
PRIOR FILING DATE: 2000-10-06 
PRIOR APPLICATION NUMBER: 30/238,384 
PRIOR FILING DATE: 2000-10-06 
PRIOR APPLICATION NUMBER: 60/238,373 
PRIOR FILING DATE: 2000-10-06 
PRIOR APPLICATION NUMBER: 60/238,372 
PRIOR FILING DATE: 2000-10-06 
PRIOR APPLICATION NUMBER: 60/238,383 
PRIOR FILING DATE: 2000-10-06 
PRIOR APPLICATION NUMBER: 60/238,382 
PRIOR FILING DATE: 2000-10-06 
PRIOR APPLICATION NUMBER: 60/275,892 
PRIOR FILING DATE: 2001-03-14 
PRIOR APPLICATION NUMBER: 60/296,860 
PRIOR FILING DATE: 2001-06-08 
NUMBER OF SEQ ID NOS : 198 
SOFTWARE: Patentln Ver. 2.1 
SEQ ID NO 121 
LENGTH: 931 



TYPE: PRT 
; ORGANISM: Mus mus cuius 
US-09-972-211-121 



Query Match 58.2%; Score 2787; DB 11; Length 931; 

Best Local Similarity 57.3%; Pred. No. 3.5e-220; 

Matches 522; Conservative 153; Mismatches 208; Indels 28; Gaps 9; 

Qy 9 PALLGIVLAAWLRGSGAQQSA TVANPVPGANPDLLPHFLVEPEDVYIVKNKPVLLVC 65 

III : I : I M I I : I I : I I I i I : I I I : I I I I I I I I I I 

Db 26 PAL — ALLSASGTGSAAQDDEFFHELPETFPSDPPEPLPHFLIEPEEAYIVKNKPVNLYC 83 

Qy 66 KAVPATQIFFKCNGEWVT*QVT)HVIERST 125 

II I I I I I : I I I I III I III:: I : I I I II I : I I I I I I : : I I I : I I I I 
Db 84 KAS PATQI YFKCNS EWVHQKDHWDERVDET SGLI VREVS I EI S RQQVEELFGPEDYWCQ 143 

Qy 126 CVAWSSSGTTKSQKAYIRIARLRKNFEQEPLAKEVSLEQGIVLPCRPPEGIPPAEVEWLR 185 

I I I I I I : I I I I I : I I I : I I I III I I I I I I I I I I I I I :: I I I I I I I I I I I I I I I : 
Db 144 CVAWS SAGTTKS RKAYVRIAYLRKT FEQEPLGKEVSLEQEVLLQCRP PEGI PVAEVEWLK 203 

Qy 186 NEDL VT)PS LDPNVAf I TREHS L VVRQARLADTAN 245 

I I I :: I I : I I III : I : I : : : I I I I : I I I I I I I I I I I I I I : I : I : I I I I I I I I I I 
Db 204 N ED 1 1 D PAE DRN FY I T I DHNLI I KQARL S DTAN YT CVAKN I VAKRKS TTATVT VYVNGGW 263 

Qy 246 STWT EWS VC S AS CGRGWQKRS RS CTN PAPLNGGAFCEGQNVQKTACATLC P VDGS WS PWS 305 

I I I I I I I I I : : I I I 1 : I I I : I : I I I I I I I I I I I I I I I I : I I I II I I I I I I I I : II 
Db 264 STWTEWSVCNSRCGRGYQKRTRTCTNPAPLNGGAFCEGQSVQKIACTTLCPVDGRWTSWS 323 

Qy 306 KWSACGLDCTHWRSRECSDPAPRNGGEECQGTDLDTRNCTSDLCVHSASGPEDVALYVGL 365 

I I I I I : I I I I I I I I : I I I : I I I :: I I I :: I I I I I : : I : I I I I I I I : 
Db 324 KWSTCGTECTHWRRRECTAPAPKNGGKDCDGLVLQSKNCTDGLCMQAAPDSDDVALYVGI 383 

Qy 366 -IAVAVCLVLLLLVXILVYCRKKEGLDSDVADSSILTSGFQPVSIKPSKADNPHLLTIQP 424 

I I I I I I : : : I : I I : : I I : I I I I I I I I I : I I : : I I I : I 
Db 384 VIAVTVCLAITVWALFVYRKNHRDFESDIIDSSALNGGFQPVNIKAARQD LLAVPP 440 

Qy 425 DLSTTTTTYQGSLCPRQDGPSPKFQLTNGHLLSPLGGGRHTLHHSS PTSEAEEFVS 480 

II:: 1:1 : I II :|| :| II : :::|| I : III 
Db 441 DLTSAAAMYRGPVYALHD-VSDKI PMTNS P I LDPLPNLKI KVYNS SGAVT PQDDLAEFS S 499 

Qy 481 RLS TQNYF RSLPRGT— SNMTYGTFNFLGGRLMIPNTGISLLIPPDAI 526 

: I I II: : I I I I I : I I I I I I I I : I I I : I : I I I I I II 

Db 500 KLSPQMTQSLLENEALNLKNQSLARQTDPSCTAFGTFNSLGGHLIIPNSGVSLLIPAGAI 559 

Qy 527 PRGKIYEIYLTLHKPEDVRLPLAGCQTLLSPIVSCGPPGVLLTRPVI LAMDHCGEPSPDS 586 

I : I : : II : I : I : I : I : : I I : I I I I : I : I I I I I I I I I I I II I I : II : I I : 
Db 560 PQGRVYEMYVTVHRKENMRPPMEDSQTLLTPWSCGPPGALLTRPVILTLHHCADPSTED 619 

Qy 587 WSLRLKKQSCEGSWEDVLHLGEEAPSHLYYCQLEASACYVFTEQLGRFALVGEALSVAAA 646 

I : : I I I : : I II I I : : I I I : I I I : I I I : : I I I : I I M : : : III 
Db 620 WKIQLKNQAVQGQWEDVWVGEENFTTPCYIQLDAEACHILTENLSTYALVGQSTTKAAA 679 

Qy 647 KRLKLLLFAPVACTSLEYNIRVYCLHDTHDALKEWQLEKQLGGQLIQEPRVLHFKDSYH 706 

I I I I I : I I : I : I I I I : I I I I I I II I I I I I I : I I I : I : I I I I : : I I : I I I I I I 
Db 680 KRLKLAI FGPLCCS SLEYS I RVYCLDDTQDALKEVLQLERQMGGQLLEEPKALHFKGS IH 739 

Qy 707 NLRLSIHDVPSSLWKSKLLVSYQEIPFYHIWNGTQRYLHCTFTLERVSPSTSDLACKLWV 766 



I II I I I I I : I I I I I I I I I I I I I I I I I I : I : II I I I I I I I I I : I :| : I III I 
Db 740 NLRLSIHDIAHSLWKSKLLAKYQE 799 

Qy 767 WQVEGDGQSFSINFNITKDTRFAELLALESEAGVPALVGPSAFKIPFLIRQKIISSLDPP 826 

I I I I : I I I : I : : : : : I I : : : : I I I I I I I I I I I : I I I I I 
Db 800 RQVEGEGQIFQLNCTVSEEPTGIDLPLLDPASTITTVTGPSAFSIPLPIRQKLCSSLDAP 859 

Qy 827 CRRGADWRTLAQKLHLDSHLSFFASKPSPTAMILNLWEARHFPNGNLSQLAAAVAGLGQP 886 

II III II II : I I : I : : II : I I II : I I : I I I I : : I I : I I I I III : : I : 
Db 860 QTRGHDWRMLAHKLNLDRYLNYFATKSSPTGVILDLWEAQNFPDGNLSMLAAVLEEMGRH 919 

Qy 887 DAGLFTVSEAE 897 

Db 920 ETWSLAAEGQ 930 



RESULT 14 
US-10-087-684-35 

; Sequence 35, Application US/10087684 
; Publication No. US20040029116A1 
; GENERAL INFORMATION: 



APPLICANT 


: Edinger, 


Shlomit R. 


APPLICANT 


: MacDougall, John R. 


APPLICANT 


: Millet, 


Isabelle 


APPLICANT 


: Ellerman, Karen 


APPLICANT 


: Stone, 


David J. 


APPLICANT 


: Grosse, 


William M. 


APPLICANT 


: Lepley, 


Denise M. 


APPLICANT 


: Rieger, 


Daniel K. 


APPLICANT 


: Burgess 


, Cathereine E. 


APPLICANT 


: Casman, 


Stacie, J. 


APPLICANT 


Spytek, 


Kimberly A. 


APPLICANT 


Boldog, 


Ferenc L. 


APPLICANT 


Li, Li 




APPLICANT 


Padigaru, Muralidhara 


APPLICANT 


Mishra, 


Vishnu 


APPLICANT 


Shenoy, 


Suresh G. 


APPLICANT 


Rastell 


i, Luca 


APPLICANT 


Tchernev, Velizar T. 


APPLICANT 


Vernet, 


Corine A.M. 


APPLICANT 


Zerhusen, Bryan D. 


APPLICANT 


Malyank 


ar, Uriel M. 


APPLICANT 


Guo, Xiaojia 


APPLICANT 


Miller, 


Charles E. 


APPLICANT 


Gangoll 


i, Esha A. 



TITLE OF INVENTION: PROTEINS AND NUCLEIC ACIDS ENCODING SAME 

FILE REFERENCE: 21402-214 CIP 

CURRENT APPLICATION NUMBER: US/10/087,684 

CURRENT FILING DATE: 2003-03-10 

PRIOR APPLICATION NUMBER: 60/253,834 

PRIOR FILING DATE: 2000-11-29 

PRIOR APPLICATION NUMBER: 60/250,926 

PRIOR FILING DATE: 2000-11-30 

PRIOR APPLICATION NUMBER: 60/264,180 

PRIOR FILING DATE: 2001-01-25 

PRIOR APPLICATION NUMBER: 60/274,194 

PRIOR FILING DATE: 2001-03-08 



PRIOR APPLICATION NUMBER: 60/313,656 
PRIOR FILING DATE: 2001-08-20 
PRIOR APPLICATION NUMBER: 60/327,456 
PRIOR FILING DATE: 2001-10-05 
NUMBER OF SEQ ID NOS : 220 
SOFTWARE: CuraSeqList version 0.1 
SEQ ID NO 35 
LENGTH: 931 
TYPE: PRT 

ORGANISM: Mus mus cuius 
US-10-087-684-35 

Query Match 58.2%; Score 2787; DB 15; Length 931; 

Best Local Similarity 57.3%; Pred. No. 3.5e-220; 

Matches 522; Conservative 153; Mismatches 208; Indels 28; Gaps 9; 

Qy 9 PALLGIVLAAWLRGSGAQQSA TVAN PVP GAN P DL L P H FLVE P EDVYI VKNK PVLLVC 65 

III : I : I I I I I : I I : I I I I I : I I I : I I I I I I I I I I 

Db 26 PAL — ALLSASGTGSAAQDDEFFHELPETFPSDPPEPLPHFLIEPEEAYIVKNKPVNLYC 83 

Qy 66 KAVPATQIFFKCNGEWVT*QVI)HVIERSTDGSSGLPTO 125 

II I I I I I : I I I I I I I I III:: I : I I I II I : I I I I I I :: I I I : I I I I 
Db 84 KASPATQIYFKCNSEWVlIQKDHVVDERvTDETSGLIWEVSIEISRQQV^ELFGPEDYWCQ 143 

Qy 126 CVAWSSSGTTKSQKAYIRIARLRKNFEQEPLAKEVSLEQGIVLPCRPPEGIPPAEVEWLR 185 

I I I I I I : I I I II : I I I : I I I III I I I I I I I I I I I I I : : I I I I II I II MINI: 
Db 144 CVAWSSAGTTKSRKAYVRIAYLRKTFEQEPLGKEVSLEQEVLLQCRPPEGI PVAEVEWLK 203 

Qy 186 NEDLVT)PSLDPNVYITREHSLVVRQARLAOT 245 

I I I : : I I : I I III : I : I : : : I I I I : I I I I I II I I I I I I I : I : I : I I I I I I I I I I 
Db 204 NEDIIDPAEDRNFYITIDHNLIIKQARLSDTANYTCVAKNIVAKRKSTTATVIVYVNGGW 263 

Qy 246 STWTEWSVCSASCGRGWQKRSRSCTNPAPLNGGAFCEGQNVQKTACATLCPVDGSWSPWS 305 

I I I I I I I I I : : I I I I : I I I : I : I I I I I I I I I I I I I I I I : I I I II I I I I I I I I : II. 
Db 264 STWTEWSVCNSRCGRGYQKRTRTCTNPAPLNGGAFCEGQSVQKIACTTLCPVDGRWTSWS 323 

Qy 306 KWSACGLDCTHWRSRECSDPAPRNGGEECQGTDLDTRNCTSDLCVHSASGPEDVALYVGL 365 

I I I I I : I I I I I I I I : I I I : I I I :: I I I : : I I I I I : : I : I I I I I I I : 
Db 324 KWSTCGTECTHWRRRECTAPAPKNGGKDCDGLVLQSKNCTDGLCMQAAPDSDDVALYVGI 383 

Qy 366 -IAVAVCLVLLLLVLILVYCRKKEGLDSDVADSSILTSGFQPVSIKPSKADNPHLLTIQP 424 

I I I I I I : : : I : I I : : I I : I I I I I I I I I : I I : : I I I : I 
Db 384 VIAVTVCLAITVWALFVTRKNHRDFESDIIDSSALNGGFQPVNIKAARQD LLAVPP 440 

Qy 425 DLSTTTTTYQGSLCPRQDGPSPKFQLTNGHLLSPLGGGRHTLHHSS PTSEAEEFVS 480 

II:: I : I : I II : I I : I I I : : : . : I I I : III 
Db 441 DLTSAAAMYRGPvTALHD-VSDKIPMTNSPILDPLPNLKIKVTNSSGAVTPQDDLAEFSS 499 

Qy 481 RLS TQNYF RSLPRGT — SNMT YGTFNFLGGRLMI PNTGI SLLI P PDAI 526 

: I I II: : I I I I I : I I I I I I I I : I I I : I : I I I I I II 

Db 500 KLSPQMTQSLLENEALNLKNQSLARQTDPSCTAFGTFNSLGGHLIIPNSGVSLLIPAGAI 559 

Qy 527 PRGKIYEIYLTLHKPEDWLPLAGCQTLLSPIVSCGPPGVLLTRPVILAMDHCGEPSPDS 586 

I : I : : I I : I : I : I : I :: I I : I I I I : I : I I I I I I I I I I I I I I I : II : I I : 
Db 560 PQGRVYEMYVTVHRKENMRPPMEDSQTLLTPWSCGPPGALLTRPVILTLHHCADPSTED 619 



QY 



587 WSLRLKKQSCEGSWEDVLHLGEEAPSHLYYCQLEASACYVFTEQLGRFALVGEALSVAAA 646 



I ::|| I: :| MM: :IM : I MM II— M I Mill:: : Ml 
Db 620 WKI QLKNQAVQGQWEDWWGEEN FTT P C Y I QL DAEACH I LT EN LS T YALVGQS TT KAAA 67.9 

Qy 647 KRLKLLLFAPVACTSLEYNIRVYCLHDTHDALKEWQLEKQLGGQLIQEPRVLHFKDSYH 706 

M M I M I : M M M M M M I M I M I M M M M M M MM M MM I I 
Db 680 KRLKLAI FGPLCCS S LEYS I RVYCLDDTQDALKEVLQLERQMGGQLLEEPKALHFKGS IH 739 

Qy 707 NLRLSIHDVPSSLWKSKLLVSYQEIPFYHIWNGTQRYLHCTFTLERVSPSTSDLACKLWV 766 

M M M M : M M M M M M M M M M M I I I I I I I I I I M M : I I M I 

Db 740 NLRLSIHDIAHSLWKSKLLAKYQEIPFYHIWSGSQRNLHCTFTLERLSLNTVELVCKLCV 799 

Qy 767 WQVEGDGQSFSINFNITKDTRFAELLALESEAGVPALVGPSAFKIPFLIRQKIISSLDPP 826 

I M M M I M : : : : M M : : : I I I I I I I I I I I : I I I I I 
Db 800 RQVEGEGQI FQLNCTVS EEPTGI DLPLLDPAST I TTVTGP SAFS I PLP I RQKLCS SLDAP 859 

Qy 827 CRRGADWRTLAQKLHLDSHLSFFASKPSPTAMILNLWEARHFPNGNLSQLAAAVAGLGQP 886 

M Ml II II M I : I : : M : I Ml M I M II I : M I : I M I III : M : 
Db 860 QTRGHDWRMLAHKLNLDRYLNYFATKSSPTGVILDLWEAQNFPDGNLSMLAAVLEEMGRH 919 

Qy 887 DAGLFTVSEAE 897 

Db 920 ETWS LAAEGQ 930 



RESULT 15 
US-10-037-417-117 

Sequence 117, Application US/10037417 
Publication No. US20040052806A1 
GENERAL INFORMATION: 
APPLICANT: Kekuda, Ramesh 
APPLICANT: Alsobrook II, John P 
APPLICANT: Tchernev, Velizar T 
APPLICANT: Liu, Xiaohong 
APPLICANT: Spytek, Kimberly A 
APPLICANT: Patturajan, Meera 
APPLICANT: Grosse, William M 
APPLICANT: Lepley, Denise M 
APPLICANT: Burgess, Catherine E 
APPLICANT: Vernet, Corine A.M. 
APPLICANT: Li, Li 
APPLICANT: Gorman, Linda 
APPLICANT: Edinger, Shlomit R 
APPLICANT: Sciore, Paul 
APPLICANT: Ellerman, Karen 
APPLICANT: Malyankar, Uriel M 
APPLICANT: Rothenberg, Mark 
APPLICANT: Stone, David J 
APPLICANT: Boldog, Ferenc L 
APPLICANT: Guo, Xiaojia 
APPLICANT: Shenoy, Suresh G 
APPLICANT: Anderson, David W 
APPLICANT: Padigaru, Muralidhara 
APPLICANT: Taupier Jr, Raymond J 
APPLICANT: Miller, Charles E 
APPLICANT: Eisen, Andrew J 

TITLE OF INVENTION: Proteins and Nucleic Acids Encoding Same 
FILE REFERENCE: 21402-235 



; CURRENT APPLICATION NUMBER: US/10/037,417 

; CURRENT FILING DATE: 2002-09-20. 

; PRIOR APPLICATION NUMBER: 60/260,018 

; PRIOR FILING DATE: 2001-01-05 

; PRIOR APPLICATION NUMBER: 60/260,360 

; PRIOR FILING DATE: 2001-01-08 

; PRIOR APPLICATION NUMBER: 60/272,411 

; PRIOR FILING DATE: 2001-02-28 

; PRIOR APPLICATION NUMBER: 60/272,817 

; PRIOR FILING DATE: 2001-03-02 

; PRIOR APPLICATION NUMBER: 60/291,186 

; PRIOR FILING DATE: 2001-05-15 

; PRIOR APPLICATION NUMBER: 60/303,231 

; PRIOR FILING DATE: 2001-07-05 

; PRIOR APPLICATION NUMBER: 60/305,060 

; PRIOR FILING DATE: 2001-07-12 

; PRIOR APPLICATION NUMBER: 60/318,405 

; PRIOR FILING DATE: 2001-09-10 

; PRIOR APPLICATION NUMBER: 60/318,700 

; PRIOR FILING DATE: 2001-09-12 

; NUMBER OF SEQ ID NOS : 227 

; SOFTWARE: Patentln Ver. 2.1 

; SEQ ID NO 117 

LENGTH: 931 

TYPE : PRT 
; ORGANISM: Caenorhabditis elegans 
US-10-037-417-117 

Query Match 58.2%; Score 2787; DB 15; Length 931; 

Best Local Similarity 57.3%; Pred. No. 3.5e-220; 

Matches 522; Conservative 153; Mismatches 208; Indels 28; Gaps 9; 

Qy 9 PALLGIVLAAWLRGSGAQQSA TVANPVPGANPDLLPHFLVEPEDVYIVKNKPVLLVC 65 

III : I : I I I I I : I I : I I I I I : I I I : I I I I I I I I I I 

Db 26 PAL — ALLSASGTGSAAQDDEFFHELPETFPSDPPEPLPHFLIEPEEAYIVKNKPVNLYC 83 

Qy 66 KAWATQIFFKCNGEWVT^QVTDHVIERSTDGSSGL 125 

II 11111:1111 M I I III:: I : I I I II I : I I I I I I : : I I I : I I I I 
Db 84 KASPATQIYFKCNSEWVTIQKDHWDERvI)ETSGLIVT^EVSIEISRQQv^ELFGPEDYWCQ 143 

Qy 126 CVAWSSSGTTKSQKAYIRIARLRKNFEQEPLAKEVSLEQGIVLPCRPPEGIPPAEVEWLR 185 

I I I I I I : I I I I I : I I I : I I I III I I I I I I I I I I I I I : : I I I I I I I I I MINI: 
Db 144 CVAWS SAGTTKS RKAYVRIAYLRKT FEQEPLGKEVS LEQEVLLQCRP PEGI PVAEVEWLK 203 

Qy 186 NEDLVTDPSLDPNWITREHSLVVRQARLADTANYTCVAKNIVARRR 245 

I I I : : I I : I I I I I : I : I : : : I I I I : I I I I I I I I I I I I I I : I : I : I I I I I I I I I I 
Db 204 NEDIIDPAEDRNFYITIDHNLIIKQARLSDTANYTCVAKNIVAKRKSTTATVIVYVNGGW 263 

Qy 246 STWTEWSVCSASCGRGWQKRSRSCTNPAPLNGGAFCEGQNVQKTACATLCPVDGSWSPWS 305 

I I I I I I I I I : : I II I : I I I : I : II I II I I I I I I I I I I I : I I I II II I II I I I : II 
Db 264 STWTEWSVCNSRCGRGYQKRTRTCTNPAPLNGGAFCEGQSVQKIACTTLCPVDGRWTSWS 323 

Qy 306 KWSACGLDCTHWRSRECSDPAPRNGGEECQGTDLDTRNCTSDLCVHSASGPEDVALYVGL 365 

I I I I I : I I I I I I II : I I I : I I I :: I I I :: I I I I I : : I : I I I I I I I : 
Db 324 KWSTCGTECTHWRRRECTAPAPKNGGKDCDGLVLQSKNCTDGLCMQAAPDSDDVALYVGI 383 



Qy 



366 - IAVAVCLVLLLLVLI LVYCRKKEGLDSDYADS S I LTSGFQPVS I KPSKADNPHLLTIQP 424 



Db 



I I I I I I : : : I : I I : : I I : I I I I I I I I I : I I : : I I I : I 
384 VIAVWCIAITVWALFVYRKNHRDFESDIIDSSALNGGFQPVNIKAARQD LLAVPP 440 



Qy 425 DLSTTTTTYQGSLCPRQDGPSPKFQLTNGHLLSPLGGGRHTLHHSS PTSEAEEFVS 480 

II:: I : I : I II : I I : I I I : : : : I I I : Ml 

Db 441 DLTSAAAMYRGPWALHD-VSDKIPMTNSPILDPLPNLKIKVYNSSGAVTPQDDLAEFSS 499 

Qy 481 RLS— TQNYF RSLPRGT — SNMTYGTFNFLGGRLMI PNTGI SLLI PPDAI 526 

•II II: : I I I I I : I I I I I I I I : I I I : I : I I I I I II 

Db 500 KLSPQMTQSLLENEALNLKNQSLARQTDPSCTAFGTFNSLGGHLIIPNSGVSLLIPAGAI 559 

Qy 527 PRGKIYEIYLTLHKPEDVRLPLAGCQTLLSPIVSCGPPGVLLTRPVILAMDHCGEPSPDS 586 

I : I : : I I : I : I : I : | : : | | : I I I I : I : I I I I I I I I I I II II I : II : I I : 
Db 560 PQGRVYE3VIYVTVHRKENMRPPMEDSQTLLTPVVSCGPPGALLTRPVILTLHHCADPSTED 619 

Qy 587 WSLRLKKQSCEGSWEDVLHLGEEAPSHLYYCQLEASACYVFTEQLGRFALVGEALSVAAA 646 

I : : I I I : : I I I I I : : I I I : I I I : I I I :: I I I : I I I I :: : I I I 
Db 620 WKIQLKNQAVQGQWEDWWGEENFTTPCYIQLDAEACHILTENLSTYALVGQSTTKAAA 679 

Qy 647 KRLKLLLFAPVACTSLEYNIRVYCLHDTHDALKEWQLEKQLGGQLIQEPRVLHFKDSYH 706 

I I I II : I I : I : I I I I : I I I I I I II I I I I I I : I I I : I : I I I I : : I ! : I I I I I I 
Db 680 KRLKLAIFGPLCCSSLEYSIRVYCLDDTQDALKEVLQLERQMGGQLLEEPKALHFKGSIH 739 

Qy 707 NLRLSIHDVPSSLWKSKLLVSYQEIPFYHIWNGTQRYLHCTFTLERVSPSTSDLACKLWV 766 

I I I I I I I I : I I I I I I I I I I I II I I I I I : I : I I I I I I I I I I I : I : I : I I I I I 
Db 740 NLRLSIHDIAHSLWKSKLLAKYQEIPFYHIWSGSQRNLHCTFTLERLSLNTVELVCKLCV 799 

Qy 767 WQVEGDGQSFSINFNITKDTRFAELLALESEAGVPALVGPSAFKIPFLIRQKIISSLDPP 826 

I I I I : I I I : I : :: : : I I : : : : I I I I I I I I I I I : I I I I I 
Db 800 RQVEGEGQIFQLNCTVSEEPTGIDLPLLDPASTITTVTGPSAFSIPLPIRQKLCSSLDAP 859 

Qy 827 CRRGADWRTLAQKLHLDSHLSFFA3KPSPTAMILNLWEARHFPNGNLSQLAAAVAGLGQP 886 

II Ml II I I : I I : I : : I I : I III : I I : I I I I : : I I : I I I I III : : I : 
Db 860 QTRGHDWRMLAHKLNLDRYLNYFATKSSPTGVILDLWE1AQNFPDGNLSMLAAVLEEMGRH 919 

Qy 887 DAGLFTVS EAE 897 

• • • i • 

• • • I • 

Db 920 ETWS LAAEGQ 930 



Search completed: March 1, 2005, 09:51:25 
Job time : 176.833 sees 



GenCore version 5.1.6 
Copyright (c) 1993 - 2005 Compugen Ltd. 



OM protein - protein search, using sw model 



Run on: 



March 1, 2005, 08:41:47 ; Search time 164.273 Seconds 

(without alignments) 
2799.282 Million cell updates/sec 



Title: 

Perfect score: 
Sequence : 



US-10-624-932-2 
4791 

1 MAVRPGLWPALLGIVLAAWL AVAGL GQ P DAGLFTVS EAEC 898 



Scoring table: 



BLOSUM62 

Gapop 10.0 , Gapext 0.5 



Searched: 



1612378 seqs, 512079187 residues 



Total number of hits satisfying chosen parameters: 1612378 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 

Database : UniProt_03 : * 

1: uniprot__sprot : * 
2 : uniprot_trembl : * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 

% 

Result Query 

No. Score Match Length DB ID Description 



1 


4685 


97. 


8 


898 
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RESULT 1 
UN5A_M0USE 

ID UN5A_M0USE STANDARD; PRT; 898 AA. 

AC Q8K1S4; Q6PEF7; Q80T71; 

DT 25-OCT-2004 (Rel. 45, Created) 

DT 25-OCT-2004 (Rel. 45, Last sequence update) 

DT 25-OCT-2004 (Rel. 45, Last annotation update) 

DE Netrin receptor UNC5A precursor (Unc-5 homolog A) (Unc-5 homolog 1) . 

GN Name=Unc5a; Synonyms=Kiaal976, Unc5hl; 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 

,0C Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. (ISOFORM 1), AND TISSUE SPECIFICITY. 

RX MEDLINE=22239710; PubMed=12351186; DOI=10 . 1016/S0925-4773 ( 02 ) 00248-4 ; 

RA Engelkamp D.; 

RT "Cloning of three mouse unc-5 genes and their expression patterns at 

RT mid-gestation . " ; 

RL Mech. Dev. 118:191-197(2002). 

RN [2] 

RP SEQUENCE FROM N.A. (ISOFORM 3). 



RC TISSUE=Brain; 

RX MEDLINE=22579291; PubMed=12693553 ; 

RA Okazaki N., Kikuno R. , Ohara R. , Inamoto S., Aizawa H., Yuasa S., 

RA Nakajima D., Nagase T., Ohara 0., Koga H.; 

RT "Prediction of the coding sequences of mouse homologues of KIAA gene: 

RT II. The complete nucleotide sequences of 400 mouse KIAA-homologous 

RT cDNAs identified by screening of terminal sequences of cDNA clones 

RT randomly sampled from size-fractionated libraries."; 

RL DNA Res. 10:35-48(2003). 

RN [3] 

RP SEQUENCE FROM N.A. (ISOFORM 2). 

RC STRAIN=C57BL/6; TISSUE=Brain; 

RX MEDLINE=22388257; PubMed=12477932 ; DOI=10 . 1073/pnas . 242603899 ; 

RA Strausberg R.L., Feingold E. A. , Grouse L.H., Derge J.G., 

RA Klausner R.D., Collins F.S., Wagner L., Shenmen CM., Schuler G.D., 

RA Altschul S.F., Zeeberg B., Buetow K.H., Schaefer C.F., Bhat N.K., 

RA Hopkins R.F., Jordan H., Moore T. f Max S.I., Wang J., Hsieh F., 

RA Diatchenko L., Marusina K., Farmer A. A. , Rubin G.M., Hong L., 

RA Stapleton M., Soares M. B. , Bonaldo M.F., Casavant T.L., Scheetz T.E., 

RA Brownstein M.J., Usdin T.B., Toshiyuki S., Carninci P., Prange C, 

RA . Raha S.S., Loquellano N.A. , Peters G.J., Abramson R.D., Mullahy S.J., 

RA Bosak S.A., McEwan P.J., McKernan K.J., Malek J. A. , Gunaratne P.H., 

RA Richards S., Worley K.C., Hale S., Garcia A.M. , Gay L.J., Hulyk S.W., 

RA Villalon D. K. , Muzny D.M., Sodergren E.J., Lu X., Gibbs R.A., 

RA Fahey J. , Helton E. , Ketteman M. , Madan A. , Rodrigues S., Sanchez A. , 

RA Whiting M., Madan A. , Young A.C., Shevchenko Y., Bouffard G.G., 

RA Blakesley R.W. , Touchman J.W., Green E.D., Dickson M.C., 

RA Rodriguez A.C., Grimwood J., Schmutz J., Myers R.M., 

RA Butterfield Y.S.N., Krzywinski M.I., Skalska U., Smailus D . E. , 

RA Schnerch A., Schein J.E., Jones S.J.M., Marra M.A. ; 

RT "Generation and initial analysis of more than 15,000 full-length human 

RT and mouse cDNA sequences."; 

RL Proc. Natl. Acad. Sci. U.S.A. 99:16899-16903(2002). 

CC FUNCTION: Receptor for netrin required for axon guidance. Mediates 

CC axon repulsion of neuronal growth cones in the developing nervous 

CC system upon ligand binding. Axon repulsion in growth cones may be 

CC caused by its association with DCC that may trigger signaling for 

CC repulsion. It also acts as a dependence receptor required for 

CC apoptosis induction when not associated with netrin ligand (By 

CC similarity) . 

CC -!- SUBUNIT: Interacts with the cytoplasmic part of DCC. Interacts 
CC with MAGED1. Interacts with PRKCABP, possibly mediating some 

CC interaction with PKC (By similarity) . 

CC -!- SUBCELLULAR LOCATION: Type I membrane protein. The interaction 
CC with PRKCABP regulates its surface expression and leads to its 

CC removal from surface of neurons and growth cones (By similarity) . 

CC -!- ALTERNATIVE PRODUCTS: 

CC Event =Alte ma tive splicing; Named isoforms=3; 

CC Name=l; 

CC IsoId=Q8KlS4-l ; Sequence=Displayed; 

CC Name=2 ; 

CC IsoId=Q8KlS4-2; Sequence=VSP_011697 ; 

CC Note=No experimental confirmation available; 

CC Name=3; 

CC IsoId=Q8KlS4-3; Sequence=VSP_011696; 

CC Note=No experimental confirmation available; 

CC -!- TISSUE SPECIFICITY: Restricted to central nervous system. 
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DOMAIN: The ZU5 domain mediates the interaction with MAGED1, which 
participates in the induction of apoptosis (By similarity) . 
PTM: Phosphorylated by PKC in vitro . Phosphorylated on cytoplasmic 
tyrosine residues (By similarity) . 

PTM: Proteolytically cleaved by caspases during apoptosis. The 
cleavage does not take place when the receptor is associated with 
netrin ligand. Its cleavage by caspases is required to induce 
apoptosis (By similarity) . 
SIMILARITY: Belongs to the UNC-5 family. 
SIMILARITY: Contains 1 death domain. 

SIMILARITY: Contains 1 immunoglobulin-like C2-type domain. 
SIMILARITY: Contains 1 immunoglobulin-like domain. 
SIMILARITY: Contains 2 TSP type-1 domains. 
SIMILARITY: Contains 1 ZU5 domain. 

This SWISS-PROT entry is copyright. It is produced through a collaboration 
between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 
the European Bioinf ormatics Institute. There are no restrictions on its 
use by non-profit institutions as long as its content is in no way 
modified and this statement is not removed. Usage by and for commercial 
entities requires a license agreement (See http://www.isb-sib.ch/announce/ 
or send an email to license@isb-sib. ch) . 

EMBL; AJ48J852; CAD32250.1; -. 

EMBL; AK122575; BAC65857.1; ALT_INIT. 

EMBL; BC058084; AAH58084.1; -. 

HSSP; P07996; 1LSL. 

MGD; MGI:894682; UncSa. 

InterPro; IPR000488; Death. 

InterPro; IPR011029; DEATH_like. 

InterPro; IPR003599; Ig. 

InterPro; IPR007110; Ig-like. 

InterPro; IPR000884; TSP1. 

InterPro; IPR008085; TSP_1. 

InterPro; IPR000906; ZU5. 

Pfam; PF00531; Death; 1. 

Pfam; PF00047; ig; 1. 

Pfam; PF00090; TSP_1; 2. 

Pfam; PF00791; ZU5; 1. 

PRINTS; PR01705; TSP1REPEAT. 

SMART; SM00005; DEATH; 1. 

SMART; SM00409; IG; 1. 

SMART; SM00209; TSP1; 2. 

SMART; SM00218; ZU5; 1. 

PROSITE; PS50017; D EAT H__DOMAI N ; FALSE_NEG. 
PROSITE; PS50835; IGJLIKE; 1. 
PROSITE; PS50092; TSP1; 2. 

Alternative splicing; Apoptosis; Developmental protein; 
Immunoglobulin domain; Phosphorylation; Receptor; Repeat; Signal; 
Transmembrane. 
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Query Match 97.8%; Score 4685; DB 1; Length 898; 

Best Local Similarity 96.7%; Pred. No. 0; 

Matches 868; Conservative 19; Mismatches 11; Indels 0; Gaps 0; 
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RESULT 2 
UN 5 A RAT 



ID UN5A_RAT STANDARD; PRT; 898 AA. 

AC 008721; 

DT 25-OCT-2004 (Rel. 45, Created) 

DT 25-OCT-2004 (Rel. 45, Last sequence update) 

DT 25-OCT-2004 (Rel. 45, Last annotation update) 

DE Netrin receptor UNC5A precursor (Unc-5 homo log A) (Unc-5 homolog 1) . 

GN Name=Unc5a; Synonyms =Unc5hl ; 

OS Rattus norvegicus (Rat) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi; Muridae; Murinae; Rattus. 

OX NCBI_TaxID=10116; 

RN [1] 

RP SEQUENCE FROM N.A. , FUNCTION, SUBCELLULAR LOCATION, AND TISSUE 

RP SPECIFICITY. 

RC TISSUE=Ventral spinal cord; 

RX MEDLINE=97271897; PubMed=9126742 ; 

RA Leonardo E.D., Hinck L., Masu M. , Keino-Masu K., Ackerman S.L., 

RA Tessier-Lavigne M. ; 

RT "Vertebrate homologues of C. elegans UNC-5 are candidate netrin 

RT receptors."; 

RL Nature 386:833-838 (1997) . 

RN [2] 

RP FUNCTION, AND INTERACTION WITH DCC. 

RX PubMed=10399920; 

RA Hong K. , Hinck L. , Nishiyama M. , Poo M.-M., Tessier-Lavigne M. , 

RA Stein E. ; 

RT "A ligand-gated association between cytoplasmic domains of UNC5 and 

RT DCC family receptors converts netrin-induced growth cone attraction to 

RT repulsion."; 



RL Cell 97:927-941(1999). 

RN [3] 

RP TISSUE SPECIFICITY. 

RX PubMed= 11472849; 

RA Barrett C, Guthrie S.; 

RT "Expression patterns of the netrin receptor UNC5H1 among developing 



RT motor neurons in the embryonic rat hindbrain . " ; 

RL Mech. Dev. 106:163-166(2001). 

RN [4] 

RP FUNCTION. 

RX PubMed=11387206; DOI=10 . 1093/embo j /20 . 11 . 2715 ; 

RA Llambi F., Causeret F. , Bloch-Gallego E., Mehlen P.; 

RT "Netrin- 1 acts as a survival factor via its receptors UNC5H and DCC"; 

RL EMBO J. 20:2715-2722(2001). 

RN [5] 

RP FUNCTION, SUBCELLULAR LOCATION, AND INTERACTION WITH MAGED1 . 

RX PubMed=12598531; DOI=10 . 1074/ jbc.M300415200; 

RA Williams M.E., Strickland P., Watanabe K. , Hinck L. ; 

RT "UNC5H1 induces apoptosis via its juxtamembrane region through an 

RT interaction with NRAGE . " ; 

RL J. Biol. Chem. 278:17483-17490(2003). 

RN [6] 

RP INTERACTION WITH PRKCABP, PHOSPHORYLATION, AND MUTAGENESIS OF 

RP 896-ALA— CYS-898. 

RX PubMed=14672991; DOI=23/36/11279; 

RA Williams M.E., Wu S.C.-Y., McKenna W.L., Hinck L.; 

RT "Surface expression of the netrin receptor UNC5H1 is regulated through 

RT a protein kinase C-interacting protein/protein kinase-dependent 

RT mechanism. " ; 

RL J. Neurosci. 23:11279-11288(2003). 

CC -!- FUNCTION: Receptor for netrin required for axon guidance. Mediates 
CC axon repulsion of neuronal growth cones in the developing nervous 

CC system upon ligand binding. Axon repulsion in growth cones may be 

CC caused by its association with DCC that may trigger signaling for 

CC repulsion. It also acts as a dependence receptor required for 

CC apoptosis induction when not associated with netrin ligand. 

CC -!- SUBUNIT: Interacts with the cytoplasmic part of DCC. Interacts 
CC with MAGED1. Interacts with PRKCABP, possibly mediating some 

CC interaction with PKC. 

CC -!- SUBCELLULAR LOCATION: Type I membrane protein. The interaction 
CC with PRKCABP regulates its surface expression and leads to its 

CC removal from surface of neurons and growth cones. 

CC -!- TISSUE SPECIFICITY: Mainly expressed in regions of differentiating 
CC neurons. Expressed at early stages of neural tube development in 

CC the ventral spinal cord. In developing hindbrain, it colocalizes 

CC with a number of cranial motor neuron subpopulations from 

CC embryonic Ell to E14, while DCC is expressed by motor neurons at 

CC E12. Also expressed in non-neural structures, such as the basal 

CC plane of the hindbrain and midbrain, in the developing 

CC hypothalamus, thalamus and in the pallidium. 

CC -!- DOMAIN: The ZU5 domain mediates the interaction with MAGED1, which 

CC participates in the induction of apoptosis. 

CC -!- PTM: Phosphorylated on cytoplasmic tyrosine residues (By 

CC similarity) . Phosphorylated by PKC in vitro. 

CC -!- PTM: Proteolytically cleaved by caspases during apoptosis. The 
CC cleavage does not take place when the receptor is associated with 

CC netrin ligand. Its cleavage by caspases is required to induce 
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apoptosis . 

SIMILARITY: 

SIMILARITY: 

SIMILARITY: 

SIMILARITY: 

SIMILARITY: 

SIMILARITY: 



Belongs to the UNC-5 family. 
Contains 1 death domain. 

Contains 1 immunoglobulin-like C2-type domain. 
Contains 1 immunoglobulin-like domain. 
Contains 2 TSP type-1 domains. 
Contains 1 ZU5 domain. 



This SWISS-PROT entry is copyright. It is produced through a collaboration 
between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 
the European Bioinf ormatics Institute. There are no restrictions on its 
use by non-profit institutions as long as its content is in no way 
modified and this statement is not removed. Usage by and for commercial 
entities requires a license agreement (See http://www.isb-sib.ch/announce/ 
or send an email to license@isb-sib. ch) . 

EMBL; U87305; AAB57678.1; -. 
HSSP; P07996; 1LSL. 
RGD; 621755; Unc5hl. 
InterPro; IPR000488; Death. 
InterPro; IPR011029; DEATH_like. 
InterPro; IPR003599; Ig. 
InterPro; IPR007110; Ig-like. 
InterPro; IPR000884; TSP1. 
InterPro; IPR008085; TSP_1. 
InterPro; IPR000906; ZU5. 
Pfam; PF00531; Death; 1. 
Pfam; PF00047; ig; 1. 
Pfam; PF00090; TSP_1; 2. 
Pfam; PF00791; ZU5; 1. 
PRINTS; PR01705; TSP1REPEAT. 
SMART; SM00005; DEATH; 1. 
SMART; SM00409; IG; 1. 
SMART; SM00209; TSP1; 2. 
SMART; SM00218; ZU5; 1. 

PROSITE; PS50017; DEATH__DOMAIN ; FALSE_NEG. 
PROSITE; PS50835; IG_LIKE; 1. 
PROSITE; PS50092; TSP1; 2. 

Apoptosis; Developmental protein; Immunoglobulin domain; 
Phosphorylation; Receptor; Repeat; Signal; Transmembrane. 



FT 


SIGNAL 


1 


25 


Potential . 


FT 


CHAIN 


26 


898 


Netrin receptor UNC5A. 


FT 


DOMAIN 


26 


361 


Extracellular (Potential) . 


FT 


TRANSMEM 


362 


382 


Potential. 


FT 


DOMAIN 


383 


898 


Cytoplasmic (Potential) . 


FT 


. DOMAIN 


44 


141 


Ig-like. 


FT 


DOMAIN 


155 


238 


Ig-like C2-type. 


FT 


DOMAIN 


242 


296 


TSP type-1 1. 


FT 


DOMAIN 


298 


350 


TSP type-1 2. 


FT 


DOMAIN 


495 


598 


ZU5. 


FT 


DOMAIN 


817 


897 


Death. 


FT 


SITE 


396 


397 


Cleavage (by caspase-3) (By similarity) . 


FT 


SITE 


661 


679 


Interaction with DCC (By similarity) . 


FT 


DISULFID 


65 


124 


By similarity. 


FT 


DISULFID 


170 


221 


By similarity. 


FT 


CARBOHYD 


107 


.107 


N-linked (GlcNAc. . .) (Potential). 


FT 


CARBOHYD 


218 


218 


N-linked (GlcNAc. . .) (Potential). 



FT CARBOHYD 343 343 N-linked (GlcNAc. . .) (Potential). 

FT MUTAGEN 896 898 Missing: Abolishes interaction with 

FT PRKCABP . 

SQ SEQUENCE 898 AA; 98840 MW; 7 A3 CBCB 9 E 7 ACA1 3 5 CRC64; 

Query Match 96.8%; Score 4638; DB 1; Length 898; 

Best Local Similarity 96.0%; Pred. No. 0; 

Matches 862; Conservative 17; Mismatches 19; Indels 0; Gaps 0; 

Qy 1 MAVT^PGLWPALLGIVIiAAWLRGSGAQQSATVANPVPGANPDLLPHFLVE 60 

I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I 
Db 1 MAVl^PGLWPVXLGI VXAAWLRGSGAQQSATVANPVPGA^ 60 

Qy 61 VXLVCKAWATQIFFKCNGEWVT*QVT)Hv^ 120 

I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I 
Db 61 VXLVCKAWATQIFFKCNGEWV^QVI)HVIERSTDSSSGLPTMEVRINVSRQQVEKVFGLE 120 

Qy 121 EYWCQCVAWSSSGTTKSQKAYIRIARLRKNFEQEPLAKEVSLEQGIVLPCRPPEGIPPAE 180 

I I I I I M I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 121 EYWCQCVAWSSSGTTKSQKAYIRIAYLRKNFEQEPLAKEVSLEQGIVLPCRPPEGIPPAE 180 

Qy 181 VEWLRN E DLVD P S L D PNVYI T REH S LWRQARLADTAN YT CVAKN I VARRRS AS AAVT VY 240 

I I I I I I I I I I I I I I I I I I I I I I I I I I I II II I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 181 VEWLRNEDLVTDPSLDPNVYITREHSLWRQARIJUDTANYTC^ 240 

Qy 241 WGGWSTWTEWSVCSASCGRGWQKRSRSCTNPAPLNGGAFCEGQNVQKTACATLCPVXIGS 300 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 241 WGGWSTWTEWSVCSASCGRGWQKRSRSCTNPAPLNGGAFCEGQNVQKTACATLCPVDGS 300 

Qy 301 WSPWSKWSACGLDCTHWRSRECSDPAPRNGGEECQGTDLDTRNCTSDLCVHSASGPEDVA 360 

II I I I I I I I I I II I I I I I I II I I I I I I I I I I I I : I I I II MM I I I 1:1:1 I Mill 

Db 301 WSSWSKWSACGLDCTHWRSRECSDPAPRNGGEECRGADLDTRNCTSDLCLHTASCPEDVA 360 

Qy 361 LWGLIAVAVCLVLLLLVXILVYCRKKEGLDSDVADSSILTSGFQPVSIKPSKADNPHLL 420 

M : I I : I I II I I I I I I I I : I I I M I I I I I I II II II I I I I I II I II I I I II I I I I II 
Db 361 LYIGLVAVAVCLFLLLLALGLIYCRKKEGLDSDVADSSILTSGFQPVSIKPSKADNPHLL 420 

Qy 421 TIQPDLSTTTTTYQGSLCPRQDGPSPKFQLTNGHLLSPLGGGRHTLHHSSPTSEAEEFVS 480 

I I M I II I II I I II I II I I II II I I I I II : I I I I I I I I I I II I I I II I M I II I : I II 
Db 421 TIQPDLSTTTTTYQGSLCSRQDGPSPKFQLSNGHLLSPLGSGRHTLHHSSPTSEAEDFVS 480 

Qy 481 RLSTQNYFRSLPRGTSNMTYGTFNFLGGRLMI PNTGI SLLI PPDAI PRGKI YEI YLTLHK 540 

I M II I I II I M II I I I I I I I I I II I I I I I I I I II || II || I I I I I II I II I II I I I I I 
Db 481 RLSTQNYFRS LP RGTSNMAYGTFNFLGGRLMI PNTGI SLLI PPDAI PRGKI YEI YLTLHK 540 

Qy 541 PEDVRLPLAGCQTLLSPIVSCGPPGVLLTRPVILAMDHCGEPSPDSWSLRLKKQSCEGSW 600 

I I II I I II I II II I II I M I I II I II I M I I I II II M I I I I II I II II II II I I I II II 
Db 541 PEDVRLPLAGCQTLLSPWSCGPPGVLLTRPVI LAMDHCGEPSPDSWSLRLKKQSCEGSW 600 

Qy 601 EDVLHLGEEAPSHLYYCQLEASACYVFTEQLGRFALVGEALSVAAAKRLKLLLFAPVACT 660 

I I M I M I I : I II I II II I II I I II I I II I I I I I I I I I I I II II II I : II II I I I I II 
Db 601 EDVLHLGEESPSHLYYCQLEAGACYVFTEQLGRFALVGEALSVAATKRLRLLLFAPVACT 660 



Qy 

Db 



661 
661 



SLEYNIRVYCLHDTHDALKEWQLEKQLGGQLIQEPRVLHFKDSYHNLRLSIHDVPSSLW 
I I M I M II I I I II II I II I II II I I I M I II I I I I II II I I II II I II I II I II II I II 
SLEYNIRVYCLHDTHDALKEWQLEKQLGGQLIQEPRVLHFKDSYHNLRLSIHDVPSSLW 



720 
720 



Qy 721 KSKLLVS YQEI PFYHIWNGTQRYLHCTFTLERVS PSTSDLACKLWVWQVEGDGQS FSINF 780 

Ml Mill lllllllllll 11:11 Mill III:: I I II I I I I : I II I II II I II I : II I 
Db 721 KSKLLVSYQEIPFYHIWNGTQQYLHCTFTLERINASTSDLACKVWVWQVEGDGQSFNINF 780 

Qy 781 NITKDTRFAELLALESEAGVPALVGPSAFKIPFLIRQKIISSLDPPCRRGADWRTLAQKL 840 

I I I I M I I I I I I I I M I I I I M I II M I I I I I I I II I M : I II I II llllllllllll 
Db 781 NITKDTRFAELLALESEGGVPALVGPSAFKIPFLIRQKIIASLDPPCSRGADWRTLAQKL 840 

Qy 841 HLDSHLSFFASKPSPTAMILNLWEARHFPNGNLSQLAAAVAGLGQPDAGLFTVSEAEC 898 

I I I I I I I I M I I I I I I I I I I I I I I II I I I I I I I II I I I I I I II I II I I I II I II II I 
Db 841 HLDSHLSFFASKPSPTAMILNLWEARHFPNGNLGQLAAAVAGLGQPDAGLFTVSEAEC 898 



RESULT 3 
UN5A_HUMAN 

ID UN5A_HUMAN STANDARD; PRT; 842 AA. 

AC Q6ZN44; Q8TF26; Q96GP4; 

DT 25-OCT-2004 (Rel. 45, Created) 

DT 25-OCT-2004 (Rel. 45, Last sequence update) 

DT 25-OCT-2004 (Rel. 45, Last annotation update) 

DE Netrin receptor UNC5A precursor (Unc-5 homolog A) (Unc-5 homolog 1) . 

GN Name=UNC5A; Synonyms=KIAA1976, UNC5H1; 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBIJTaxI D= 9606; 

RN [1] 

RP SEQUENCE FROM N.A. (ISOFORM 2) . 

RC TISSUE=Brain; 

RX PubMed=14702039; DOI=10 . 1038/ngl285 ; 

RA Ota T., Suzuki Y., Nishikawa T., Otsuki T., Sugiyama T., Irie R. , 

RA Wakamatsu A., Hayashi K., Sato H., Nagai K. , Kimura K., Makita H . , 

RA Sekine M. , Obayashi M. , Nishi T . , Shibahara T., Tanaka T., Ishii S., 

RA Yamamoto J. -I., Saito K., Kawai Y., Isono Y., Nakamura Y., 

RA Nagahari K. , Murakami K., Yasuda T., Iwayanagi T., Wagatsuma M. , 

RA Shiratori A., Sudo H. f Hosoiri T., Kaku Y., Kodaira H., Kondo H., 

RA Sugawara M. , Takahashi M. , Kanda K. , Yokoi T., Furuya T., Kikkawa E., 

RA Omura Y. , Abe K., Kamihara K., Katsuta N., Sato K., Tanikawa M. , 

RA Yamazaki M. , Ninomiya K. , Ishibashi T., Yamashita H., Murakawa K. , 

RA Fujimori K. , Tanai H., Kimata M. , Watanabe M. , Hiraoka S., Chiba Y. , 

RA Ishida S., Ono Y., Takiguchi S., Watanabe S., Yosida M. , Hotuta T., 

RA Kusano J . , Kanehori K. , Takahashi-Fujii A., Hara H., Tanase T.-O., 

RA Nomura Y., Togiya S., Komai F., Hara R. , Takeuchi K. , Arita M. , 

RA Imose N., Musashino K. , Yuuki H., Oshima A., Sasaki N., Aotsuka S. f 

RA Yoshikawa Y. , Matsunawa H., Ichihara T. f Shiohata N., Sano S., 

RA Moriya S., Momiyama H., Satoh N., Takami S., Terashima Y. , Suzuki O., 

RA Nakagawa S., Senoh A., Mizoguchi H., Goto Y., Shimizu F., Wakebe H. , 

RA Hishigaki H., Watanabe T., Sugiyama A., Takemoto M. , Kawakami B. , 

RA Yamazaki M. , Watanabe K., Kumagai A., Itakura S., Fukuzumi Y. f 

RA Fujimori Y., Komiyama M. , Tashiro H., Tanigami A., Fujiwara T., 

RA Ono T. , Yamada K. , Fujii Y., Ozaki K. , Hirao M. , Ohmori Y., 

RA Kawabata A. , Hikiji T., Kobatake N . , Inagaki H., Ikema Y., Okamoto S., 

RA Okitani R. , Kawakami T., Noguchi S., Itoh T. f Shigeta K w Senba T. , 

RA Matsumura K., Nakajima Y., Mizuno T., Morinaga M. , Sasaki M., 

RA Togashi T w Oyama M. , Hata H., Watanabe M. , Komatsu T., 

RA Mizushima-Sugano J . , Satoh T., Shirai Y., Takahashi Y. , Nakagawa K. , 

RA Okumura K., Nagase T., Nomura N., Kikuchi H., Masuho Y., Yamashita R. , 



RA Nakai K. , Yada T . , Nakamura Y., Ohara O., Isogai T., Sugano S.; 

RT "Complete sequencing and characterization of 21,243 full-length human 

RT cDNAs . " ; 

RL Nat. Genet. 36:40-45(2004). 

RN [2] 

RP SEQUENCE FROM N.A. (ISOFORM 3) , AND SEQUENCE OF 302-842 FROM N.A. 

RC TISSUE=Brain; 

RX MEDLINE=22388257; PubMed=12477932 ; DOI=10 . 1073/pnas . 242603899; 

RA Strausberg R.L., Feingold E.A., Grouse L. H . , Derge J.G., 

RA Klausner R. D. , Collins F.S., Wagner L., Shenmen CM. , Schuler G.D., 

RA Altschul S.F., Zeeberg B., Buetow K.H., Schaefer C.F., Bhat N.K., 

RA Hopkins R.F., Jordan H., Moore T., Max S.I., Wang J. , Hsieh F., 

RA Diatchenko L. , Marusina K. , Farmer A. A. , Rubin G.M., Hong L., 

RA Stapleton M. , Soares M.B., Bonaldo M. F. , Casavant T.L., Scheetz T . E. , 

RA Brownstein M.J., Usdin T.B., Toshiyuki S., Carninci P., Prange C, 

RA Raha S.S., Loquellano N.A., Peters G.J., Abramson R.D., Mullahy S.J., 

RA Bosak S.A., McEwan P. J., McKernan K.J., Malek J. A., Gunaratne P.H., 

RA Richards S., Worley K.C., Hale S., Garcia A.M. , Gay L.J., Hulyk S.W., 

RA. Villalon D.K., Muzny D.M., Sodergren E.J., Lu X., Gibbs R.A. , 

RA Fahey J . , Helton E., Ketteman M., Madan A. , Rodrigues S., Sanchez A. , 

RA Whiting M. , Madan A., Young A.C., Shevchenko Y. , Bouffard G. G. , 

RA Blakesley R.W. , Touchman J.W., Green E.D., Dickson M.C., 

RA Rodriguez A.C., Grimwood J., Schmutz J., Myers R.M., 

RA Butterfield Y.S.N. , Krzywinski M.I., Skalska U., Smailus D.E., 

RA Schnerch A. , Schein J.E., Jones S.J.M., Marra M.A. ; 

RT "Generation and initial analysis of more than 15,000 full-length human 

RT and mouse cDNA sequences."; 

RL Proc. Natl. Acad. Sci. U.S.A. 99:16899-16903(2002). 

RN [3] 

RP SEQUENCE OF 624-728 FROM N.A. 

RC TISSUE=Brain; 

RX MEDLINE=21842142; PubMed=11853319; 

RA Nagase T. , Kikuno R. , Ohara O. ; 

RT "Prediction of the coding sequences of unidentified human genes. XXII. 

RT The complete sequences of 50 new cDNA clones which code for large 

RT proteins."; 

RL DNA Res. 8:319-327(2001). 

RN [4] 

RP INDUCTION. 

RX PubMed=12598906; DOI=10 . 1038/ncb943; 

RA Tanikawa C, Matsuda K., Fukuda S., Nakamura Y. , Arakawa H. ; 

RT "p53RDLl regulates of p53-dependent apoptosis."; 

RL Nat. Cell Biol. 5:216-223(2003). 

RN [5] 

RP DOWN-REGULATION IN CANCER. 

RX PubMed=12655055; DOI=10 . 1073/pnas . 0738063100 ; 

RA Thiebault K. , Mazelin L., Pays L., Llambi F. f Joly M.-O., 

RA Scoazec J.-Y., Saurin J.-C, Romeo G. f Mehlen P.; 

RT "The netrin-1 receptors UNC5H are putative tumor suppressors 

RT controlling cell death commitment."; 

RL Proc. Natl. Acad. Sci. U.S.A. 100:4173-4178(2003). 

CC -!- FUNCTION: Receptor for netrin required for axon guidance. Mediates 
CC axon repulsion of neuronal growth cones in the developing nervous 

CC system upon ligand binding. Axon repulsion in growth cones may be 

CC caused by its association with DCC that may trigger signaling for 

CC repulsion. It also acts as a dependence receptor required for 

CC apoptosis induction when not associated with netrin ligand. 



CC -!- SUBUNIT: Interacts with the cytoplasmic part of DCC. Interacts 
CC with MAGED1. Interacts with PRKCABP, possibly mediating some 

CC interaction with PKC (By similarity) . 

CC -!- SUBCELLULAR LOCATION: Type I membrane protein. The interaction 
CC with PRKCABP regulates its surface expression and leads to its 

CC removal from surface of neurons and growth cones (By similarity) . 

CC -!- ALTERNATIVE PRODUCTS: 

CC . Event=Alternative splicing; Named isoforms=3; 

CC Name=l; 

CC IsoId=Q6ZN44-l; Sequence=Displayed; 

CC Note=No experimental confirmation available; 

CC Name=2 ; 

CC IsoId=Q6ZN44-2; Sequence=VSP_011694 , VSP_011695; 

CC Note=No experimental confirmation available; 

CC Name=3; 

CC IsoId=Q6ZN44-3; Sequence=VSP_011693 ; 

CC Note=No experimental confirmation available; 

CC -!- INDUCTION: By p53/TP53. 

CC -!- DOMAIN: The ZU5 domain mediates the interaction with MAGED1, which 

CC participates in the induction of apoptosis (By similarity) . 

CC -!- PTM: Phosphorylated on cytoplasmic tyrosine residues. 

CC Phosphorylated by PKC in vitro (By similarity) . 

CC -!- PTM: Proteolytically cleaved by caspases during apoptosis. The 

CC cleavage does not take place when the receptor is associated with 

CC netrin ligand. Its cleavage by caspases is required to induce 

CC apoptosis (By similarity) . 

CC MISCELLANEOUS: Down- regulated in multiple cancers including 

CC colorectal, breast , ovary, uterus, stomach, lung, or kidney 

CC cancers . 

CC -!- SIMILARITY: Belongs to the UNC-5 family. 

CC -!- SIMILARITY: Contains 1 death domain. 

CC -!- SIMILARITY: Contains 1 immunoglobulin-like C2-type domain. 

CC -!- SIMILARITY: Contains 1 immunoglobulin-like domain. 
CC SIMILARITY: Contains 1 TSP type-1 domain. 

CC -!- SIMILARITY: Contains 1 ZU5 domain. 

CC -!- CAUTION: Ref.3 sequence differs from that shown due to the 
CC presence of introns . 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib. ch) . 



CC --: 

DR EMBL; AK131380; BAD18531.1; -. 

DR EMBL; BC009333; AAH09333.2; -. 

DR EMBL; BC033727; -; NOT_ANNOTATED_CDS . 

DR EMBL; AB075856; BAB85562.1; ALT_SEQ. 

DR Genew; HGNC: 12567; UNC5A. 

DR MIM; 607869; 

DR InterPro; IPR003599; Ig. 

DR InterPro; IPR007110; Ig-like. 

DR InterPro; IPR000884; TSP1. 

DR Pfam; PF00047; ig; 1. 

DR Pfam; PF00090; TSP 1; 1. 
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SMART; SM00409; IG; 1. 
SMART; SM00209; TSP1; 1. 

PROSITE; PS50017; DEATHJDOMAIN; FALSE_NEG. 

PROSITE; PS50835; IG_LIKE; 1. 
PROSITE; PS50092; TSP1; 1. 

Alternative splicing; Apoptosis; Developmental protein; 
Immunoglobulin domain; Phosphorylation; Receptor; Signal; 
Transmembrane . 

SIGNAL 1 25 Potential. 

CHAIN 26 842 Netrin receptor UNC5A. 

DOMAIN 1 306 Extracellular (Potential) . 

TRANSMEM 307 327 Potential. 

DOMAIN 328 842 Cytoplasmic (Potential). 

DOMAIN 44 141 Ig-like. 

DOMAIN 155 234 Ig-like C2-type. 

DOMAIN 242 294 TSP type-1. 

DOMAIN 439 542 ZU5 . 

DOMAIN 761 841 Death. 

SITE 340 341 Cleavage (by caspase-3) (By similarity) 

SITE 605 623 Interaction with DCC (By similarity) . 

DISULFID 65 124 By similarity. 

DISULFID 170 221 By similarity. 

CARBOHYD 107 107 N-linked (GlcNAc. . .) 

CARBOHYD 218 218 N-linked (GlcNAc. . .) 

CARBOHYD 287 287 N-linked (GlcNAc. . .) 
VARSPLIC 1 97 



(Potential) . 
(Potential) . 
(Potential) . 



MAVRPGLWPALLGIVLAAWLRGSGAQQSATVANPVPGANPD 
LL PH FLVE P EDVY I VKNK PVL LVC KAVPATQ I FFKCNGEWV 
RQVDHVTERSTDGSN -> MAGTSERSLISSISQPKAIECF 
EVKKKAFLTHGRYHGS GAT P P KT KDPKP ET FCGQT ( in 
isoform 3) . 
/FTId=VSP_011693. 
VARSPLIC 296 301 TASGPE -> SESSLP (in isoform 2). 

/FTId=VSP_011694. 
VARSPLIC 302 842 Missing (in isoform 2) . 

/FTId=VSP__011695. 
SEQUENCE 842 AA; 92958 MW; 3DFADCF973131849 CRC64; 



Query Match 92.0%; Score 4410; DB 1; Length 842; 

Best Local Similarity 93.4%; Pred. No. 0; 

Matches 839; Conservative 2; Mismatches 1; Indels 56; Gaps 



1; 



QY 
Db 



1 MAVRPGLWPALLGIVIWVWLRGSGAQQSATVANPVPGANPDLLPHFLV^PEDWIVICNKP 60 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 I I I I I I I 1 I I 
1 MAWPGLWPALLGIVXAAWLRGSGAQQSAWANPVPGANPDLLPHFLV^^ 60 



Qy 

Db 

Qy 

Db 



61 VXLVCKAVPATQIFFKCNGEWVl^QVDHVIERSTDGSSGLPTMEVRINVSRQQVEKVFGLE 120 

I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I II I I I : I I I I I I I I I I I I I I I I I I I I I I I 
61 VXLVCKAVPATQIFFKCNGEWV^QV^HVIERSTDGSNGLPTMEVRINVSRQQVEKVFGLE 120 



121 



180 



EYWCQCVAWS SSGTTKSQKAYI RIARLRKNFEQEPLAKEVSLEQGI VLPCRP PEGI PPAE 
I I I I I I II II I I I I II I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
121 EYWCQCVAWSSSGTTKSQKAYIRIAYLRKNFEQEPLAKEVSLEQGIVLPCRPPEGIPPAE 180 



Qy 

Db 



181 V^WLRNEDLVTDPSLDPNWITREHSLWRQARLADTANYTCVAKNIVARRRSASAAVIW 240 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I II I I I I I I 
181 V^WLRNEDLVT)PSLDPNVTITREHSLVVRQARLADTANYTCVAKN^ 240 



Ov 


241 


1 1 1 l 


oUO 


Db 


241 


Mil 


244 




Ov 
vy 


301 


W^PW^KWS APnT.T1PTHWP^PPP^nPZi.PDMfzr:Trppor ,r PriT riTDMrTcnT nmoncr-DPnuA 
►j.tvv^vjxj.uv^ i ri¥vr\or\iL\^o JJx/\ir rvLN oIjLjiliv^^Ij 1 ULiU 1 KIN 1 oUxjU VnoAobrlliUVA 


ooU 






1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 t 1 I I I I J I . I I \ I I i I I 
1 I I I 1 I I I 11 1 II 1 II 1 It it 1 1 1 1 1 1 1 II 1 1 1 II 1 1! 1 1 1 M 1 1 1 1 II II 2 1 1 1 1 II II 




Db 


245 


WSPWSKWSACGLDCTHWRSRECSDPAPRNGGEECQGTDLDTRNCTSDLCVHTASGPEDVA 


304 


Ov 


361 


T.YVm\T AVAVfT.VTJJ.T \/T TT WfDVVPrT nQn^/anccTT TcrpnDucTyocvAnMDUT t 
xjx vuijj.nvnv^jjViiijjjjjvLiJ.liV I UKX\I\Ei<j±j.UoJJ VAJJoolLl obtyr Vox J\lro aAJJN irnijij 


A O A 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ) 1 1 1 1 t 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 I 1 I 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 I II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I I I I I I I I | | I 




Db 


305 


L YVGLI AVAVCLVLLLLVL I LVYCRKKEGLDS DVADS S I LTS GFQ PVS I KP S KADN PHLL 


364 


wy 


1 c. X 


i lyriJLol 1111 xyijoljV^f KylJCjrbr J\r yLlNGnJxLSrLGGGRnTLHHSSPTSEAEEFVS 


480 






1 1 I 1 1 1 1 I 1 I 1 1 1 I 1 1 1 1 1 1 I 1 ! 1 t i 1 1 1 1 1 1 1 1 1 1 t 1 ! 1 1 1 1 1 1 1 1 1 1 1 f 1 I I I I I I I I 

1 1 1 1 1 1 1 M 1 1 1 1 II 1 1 1 1 1 1 M 1 II 1 1 1 1 1 II 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


365 


TIQPDLSTTTTTYQGSLCPRQDGPSPKFQLTNGHLLSPLGGGRHTLHHSSPTSEAEEFVS 


424 


Ov 

vy 


481 

*I U X 


rvxjo x x r x\oxjiti\oX OlMlYll HjI £ IN r xiOoKljiYLX rrri 1 bl dxjjjj. r xzuJtKL 1tK\jI\X x Ljx x Lil xjiiK 


C A A 

540 






I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 I I I I t I I I I I I I I I t t I I i i i i i i i i i i i t i i i i i i i i i 

M 1 1 1 II M 1 1 1 1 1 1 1 1 I 1 1 M 1 II 1 1 1 1 1 1 II 1 1 I t 1 1 1 M 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 




Db 


425 


RLSTQNYFRSLPRGTSNMTYGTFNFLGGRLMI PNTGISLLIPPDAIPRGKIYEIYLTLHK 


484 


Ov 


1 


r iLij v Kxj ir J^i^uy 1 IjIj b r X v o L-LjF r 


600 






1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 t 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 f 

1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 I 1 1 1 1 1 1 1 1 1 II II 1 1 II 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 




Db 


485 


PEDWLPLAGCQTLLSPIVSCGPPGVLLTRPVILAMDHCGEPSPDSWSLRLKKQSCEGSW 


544 


Ov 

vy 


GUI 


EjUvjunij^Ejiii/\irorlljx luyigtvAjbAU x V r 1 Ciyii(jKr ALVGriAJjSVAAAKRLKLLLFAP VACT 


660 






1 1 1 1 1 1 1 1 \ 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 \ 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

1 1 1 1 1 1 1 1 I 1 II 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


545 


EDVLHLGEEAPSHLYYCQLEASACYVFTEQLGRFALVGEALSVAAAKRLKLLLFAPVACT 


604 


wy 


fifil 


•jIjJ-'Iimxkv xL,ijniJin]Ji\ijJYE*v vyiji^Jxyijiaij^jjiyitjrKVljnr Jvub xiiNLRLSIHDVPSSLW 


*7 O A 

720 






i 1 t I 1 1 1 1 I I i I I I 1 I t I ( I I i i i i i i i i i i i i t i i i i i > i i i i i i i i i i i i i i i i i i i i 
II II 1 1 II 1 1 t 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 | | | | | | | 




Db 


605 


S LE YNI RVYCLHDTHDALKEWQLEKQLGGQLI QEPRVLHFKDS YHNLRLS IHDVP S SLW 


664 


Ov 

wy 


771 

/ X 


ivoj\xjXjVo xycjXirr xrtxWlNijl yKI LiJtiUl r 1 xjll*KVoiro 1 oULACKJjW VWyVhjGDGQSF S1NF 


*7 O A 

780 






1 1 1 E 1 1 1 I 1 1 1 1 1 1 1 i 1 1 E 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 t 1 I I I t I i i i i i i i i i i i i i i 
N 1 1 1 1 M 1 II 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 | | | | | 1 | | | | || | 




Db 


665 


KSKLLVSYQEIPFYHIWNGTQRYLHCTFTLERVSPSTSDLACKLWVWQVEGDGQSFSINF 


724 


Ov 

wy 


7ft1 
/ox 


NITKDTRFAELLALESEAGVPALVGPSAFKIPFLIRQKIISSLDPPCRRGADWRTLAQKL 


840 






M 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


725 


NIT KDT RFAELLALE S EAGVP ALVGP SAFKI P FL I RQK 1 1 S S LD P P C RRGADWRT LAQ KL 


784 


Qy 


841 


HLDSHLSFFASKPSPTAMILNLWEARilFPNGNLSQLAAAVAGLGQPDAGLFTVSEAEC 898 






1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 




Db 


785 


HLDSHLSFFASKPSPTAMILNLWEARHFPNGNLSQIxAAAVAGLGQPDAGLFTYSEAEC 842 



RESULTS 
UN5C_M0USE 

ID UN5C_M0USE STANDARD; PRT; 931 AA. 

AC 008747; Q8CD16; 

DT 25-OCT-2004 (Rel. 45, Created) 

DT 25-OCT-2004 (Rel. 45, Last sequence update) 

DT 25-OCT-2004 (Rel. 45, Last annotation update) 

DE Netrin receptor UNC5C precursor (Unc-5 homolog C) (Unc-5 homolog 3) 
DE (Rostral cerebellar malformation protein) . 
GN Name=Unc5c; Synonyms=Rcm, Unc5h3; 
OS Mus mus cuius (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 



OC Mammalia; Eutheria; Rodentia; Sciurognathi; Muridae; Murinae; Mus . 

OX NCBI_TaxI D= 10090; 

RN [1] 

RP SEQUENCE FROM N. A. (ISOFORM 1), FUNCTION, DISEASE, AND TISSUE 

RP SPECIFICITY. 

RC STRAIN=C57B6/SJL; 

RX MEDLINE=97271898; PubMed=9126743; 

RA Ackerman S.L., Kozak L.P., Przyborski S.A., Rund L.A., Boyer B.B., 

RA Knowles B.B.; 

RT "The mouse rostral cerebellar malformation gene encodes an UNC-5-like 

RT protein."; 

RL Nature 386:838-842(1997). 

RN [2] 

RP SEQUENCE FROM N.A. (ISOFORM 2). 

RC STRAIN=C57BL/6J; TISSUE=Testis ; 

RX MEDLINE=22354683; PubMed=12466851; DOI=10 . 1038/nature01266; 

RA Okazaki Y., Furuno M. , Kasukawa T., Adachi J., Bono H., Kondo S., 

RA Nikaido I., Osato N., Saito R., Suzuki H., Yamanaka I., Kiyosawa H., 

RA Yagi K., Tomaru Y., Hasegawa Y. , Nogami A., Schonbach C, Gojobori T., 

RA Baldarelli R. , Hill D.P., Bult C, Hume D.A. , Quackenbush J., 

RA Schriml L.M., Kanapin A., Matsuda H., Batalov S., Beisel K.W., 

RA Blake J. A., Bradt D., Brusic V., Chothia C, Corbani L.E., Cousins S., 

RA Dalla E., Dragani T.A. , Fletcher C.F., Forrest A., Frazer K.S., 

RA Gaasterland T., Gariboldi M. , Gissi C, Godzik A., Gough J., 

RA Grimmond S., Gustincich S., Hirokawa N., Jackson I.J., Jarvis E.D., 

RA Kanai A. , Kawaji H., Kawasawa Y. , Kedzierski R.M., King B.L., 

RA Konagaya A., Kurochkin I.V., Lee Y., Lenhard B., Lyons P. A., 

RA Maglott D.R., Maltais L., Marchionni L., McKenzie L-, Miki H., 

RA Nagashima T., Numata K. , Okido T., Pavan W.J., Pertea G., Pesole G., 

RA Petrovsky N., Pillai R. , Pontius J.U., Qi D., Ramachandran S., 

RA Ravasi T., Reed J.C., Reed D.J., Reid J., Ring B.Z., Ringwald M. f 

RA Sandelin A., Schneider C, Semple C.A., Setou M. , Shimada K., 

RA Sultana R. , Takenaka Y., Taylor M.S., Teasdale R.D., Tomita M. , 

RA Verardo R. , Wagner L., Wahlestedt C, Wang Y. , Watanabe Y., Wells C, 

RA Wilming L.G., Wynshaw- Boris A., Yanagisawa M. , Yang I., Yang L. , 

RA Yuan Z., Zavolan M. , Zhu Y. , Zimmer A., Carninci P., Hayatsu N., 

RA Hirozane-Kishikawa T., Konno H., Nakamura M. , Sakazume N., Sato K., 

RA Shiraki T., Waki K., Kawai J., Aizawa K. , Arakawa T., Fukuda S., 

RA Hara A., Hashizume W., Imotani K. , Ishii Y., Itoh M. , Kagawa I., 

RA Miyazaki A., Sakai K., Sasaki D., Shibata K., Shinagawa A., 

RA Yasunishi A., Yoshino M. , Waterston R. , Lander E.S., Rogers J., 

RA Birney E., Hayashizaki Y. ; 

RT "Analysis of the mouse transcriptome based on functional annotation of 

RT 60,770 full-length cDNAs . " ; 

RL Nature 420:563-573(2002). 

RN [3] 

RP FUNCTION, AND TISSUE SPECIFICITY. 

RX PubMed=9389662; 

RA Przyborski S.A., Knowles B.B., Ackerman S.L.; 

RT "Embryonic phenotype of Unc5h3 mutant mice suggests chemo repulsion 

RT during the formation of the rostral cerebellar boundary."; 

RL Development 125:41-50(1998). 

RN [4] 

RP INTERACTION WITH DCC. 

RX PubMed=10399920; 

RA Hong K., Hinck L., Nishiyama M. , Poo M.-M., Tessier-Lavigne M. , 

RA Stein E. ; 



RT M A ligand-gated association between cytoplasmic domains of UNC5 and 

RT DCC family receptors converts netrin-induced growth cone attraction to 

RT repulsion,"; 

RL Cell 97:927-941(1999) . 

RN [5] 

RP PHOSPHORYLATION SITE TYR-568, AND MUTAGEN E S I S OF TYR-568. 

RX PubMed=11533026; DOI=10 . 1074/ jbc .M103872200; 

RA Tong J., Killeen M. , Steven R., Binns K.L. , Culotti J. , Pawson T.; 

RT "Netrin stimulates tyrosine phosphorylation of the UNC-5 family of 

RT netrin receptors and induces Shp2 binding to the RCM cytodomain . " ; 

RL J. Biol. Chem. 276:40917-40925(2001). 

RN [6] 

RP FUNCTION. 

RX PubMed=12451134; DOI=22/23/10346; 

RA Finger J.H., Bronson R.T., Harris B., Johnson K. , Przyborski S.A., 

RA Ackerman S.L.; 

RT "The netrin 1 receptors Unc5h3 and Dec are necessary at multiple 

RT choice points for the guidance of corticospinal tract axons."; 

RL J. Neurosci. 22:10346-10356(2002). 

CC -!- FUNCTION: Receptor for netrin required for axon guidance. Mediates 
CC axon repulsion of neuronal growth cones in the developing nervous 

CC system upon ligand binding. Axon repulsion in growth cones may be 

CC caused by its association with DCC that may trigger signaling for 

CC repulsion. Also involved in corticospinal tract axon guidances 

CC independently of DCC. It also acts as a dependence receptor 

CC required for apoptosis induction when not associated with netrin 

CC ligand. 

CC -!- SUBUNIT: Interacts with the cytoplasmic part of DCC. 

CC -!- SUBCELLULAR LOCATION: Type I membrane protein (By similarity). 

CC -!- ALTERNATIVE PRODUCTS: 

CC Event=Alternative splicing; Named isoforms=2; 

CC Name=l; 

CC IsoId=O08747-l; Sequence=Displayed; 

CC Name=2 ; 

CC IsoId=O08747-2; Sequence=VSP_011702 ; 

CC -!- TISSUE SPECIFICITY: Mainly expressed in regions of differentiating 
CC neurons. Highly expressed in brain and lung. Weakly expressed in 

CC testis, ovary, spleen, thymus and bladder. Expressed at very low 

CC level in kidney, intestine and salivary gland. 

CC -!- PTM: Phosphorylated on different cytoplasmic tyrosine residues. 
CC Phosphorylation of Tyr-568 leads to an interaction with PTPN11 

CC phosphatase, suggesting that its activity is regulated by 

CC phosphorylation/dephosphorylation. Tyrosine phosphorylation is 

CC netrin-dependent . 

CC -!- PTM: Proteolytically cleaved by caspases during apoptosis. The 
CC cleavage does not take place when the receptor is associated with 

CC netrin ligand. Its cleavage by caspases is required to induce 

CC apoptosis (By similarity) . 

CC -!- DISEASE: Defects in Unc5c are the cause of rostral cerebellar 
CC malformation (Rem) . Rem is characterized by cerebellar and 

CC midbrain defects, apparently as a result of abnormal neuronal 

CC migration. 

CC -!- SIMILARITY: Belongs to the UNC-5 family. 

CC -!- SIMILARITY: Contains 1 death domain. 

CC -!- SIMILARITY: Contains 1 immunoglobulin-like C2-type domain. 

CC -!- SIMILARITY: Contains 1 immunoglobulin-like domain. 

CC -!- SIMILARITY: Contains 2 TSP type-1 domains. 



CC -!- SIMILARITY: Contains 1 ZU5 domain. 

CC 

This SWISS-PROT entry is copyright. It is produced through a collaboration 
between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 
the European Bioinf ormatics Institute. There are no restrictions on its 
use by non-profit institutions as long as its content is in no way 
modified and this statement is not removed. Usage by and for commercial 
entities requires a license agreement (See http://www.isb-sib.ch/announce/ 
or send an email to license@isb-sib . ch) . 



CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
KW 
KW 
KW 



GO; 
GO; 
GO; 
GO; 



EMBL; U72634; AAB54103.1; 
EMBL; AK031655; BAC27495.1; 
HSSP; P07996; 1LSL. 
MGD; MGI: 1095412; Unc5c. 
GO; GO: 0005886; C:plasma membrane; IC. 

F:netrin receptor activity; IDA. 
F: protein binding; IDA. 
P: brain development; IMP. 
P: regulation of cell migration; IMP. 
Death. 
DEATH_like . 
Ig-like . 
Ig_c2 . 
TSP1. 
TSP_1. 
ZU5. 

Pfam; PF00531; Death; 1. 
Pfam; PF00047; ig; 1. 
Pfam; PF00090; TSP_1; 2. 
Pfam; PF00791; ZU5; 1. 
PRINTS; PR01705; TS PI REPEAT . 
SMART; SM00005; DEATH; 1. 
SMART; SM00408; IGc2; 1. 
SMART; SM00209; TSP1; 2. 
SMART; SM00218; ZU5; 1. 

PROSITE; PS50017; DEATH__DOMAIN ; FALSE_NEG. 
PROSITE; PS50835; IG_LIKE; 1. 
PROSITE; PS50092; TSP1; 2. 

Alternative splicing; Apoptosis; Developmental protein; 
Immunoglobulin domain; Phosphorylation; Receptor; Repeat; Signal; 
Transmembrane . 



GO 
GO 
GO 
GO 

InterPro; 
InterPro; 
InterPro; 
InterPro; 
InterPro; 
InterPro; 
InterPro; 



0005042; 
0005515; 
0007420; 
0030334; 

IPR000488; 
IPR011029; 
IPR007110; 
IPR003598; 
IPR000884; 
IPR008085; 
IPR000906; 



FT 


SIGNAL 


1 


40 


Potential . 


FT 


CHAIN 


41 


931 


Netrin receptor UNC5C. 


FT 


DOMAIN 


41 


380 


Extracellular (Potential) . 


FT 


TRANSMEM 


381 


401 


Potential . 


FT 


DOMAIN 


402 


931 


Cytoplasmic (Potential) . 


FT 


DOMAIN 


62 


159 


Ig-like. 


FT 


DOMAIN 


161 


256 


Ig-like C2-type. 


FT 


DOMAIN 


260 


314 


TSP type-1 1. 


FT 


DOMAIN 


316 


368 


TSP type-1 2. 


FT 


DOMAIN 


528 


631 


ZU5. 


FT 


DOMAIN 


850 


929 


Death. 


FT 


SITE . 


415 


416 


Cleavage (by caspase-3) (By similarity) . 


FT 


SITE 


694 


712 


Interaction with DCC (By similarity) . 


FT 


DISULFID 


83 


142 


By similarity. 


FT 


DISULFID 


188 


239 


By similarity. 


FT 


MOD RES 


568 


568 


Phosphotyrosine . 



FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
FT 
SQ 



CARBOHYD 
CARBOHYD 
VARSPLIC 

MUTAGEN 



CONFLICT 
CONFLICT 
CONFLICT 
SEQUENCE 



236 
361 
370 

568 



16 
733 
924 
931 AA; 



236 
361 
370 

568 



16 
733 
924 

103062 MW; 



N-linked (GlcNAc. . .) (Potential). 

N-linked (GlcNAc. . . ) (Potential). 

A -> GFIYPISTEHRPQNEYGFSS (in isoform 2) 

/FTId=VSP_011702. 

Y->F: Abolishes interaction with PTPNll f 
leading to a increased level of 
phosphorylation . 
L -> I (in Ref . 2) . 
H -> R (in Ref. 2) . 
S -> Y (in Ref. 2) . 



8A5D951A4EECA179 CRC64; 



Query Match 58.2%; Score 2787; DB 1; Length 931; 

Best Local Similarity 57.3%; Pred. No. 8.7e-201; 

Matches 522; Conservative 153; Mismatches 208; Indels 28; 



Gaps 



9; 



Qy 


9 


Db 


26 


Qy 


66 


Db 


84 


Qy 


126 


Db 


144 


Qy 


186 


Db 


204 


Qy 


246 


Db 


264 


Qy 


306 


Db 


324 


Qy 


366 


Db 


384 


Qy 


425 


Db 


441 


Qy 


481 


Db 


500 


Qy 


527 


Db 


560 



PALLGI VLAAWLRGS GAQQ S A TVANPVPGANPDLLPHFLVEPEDVyiVKNKPVLLVC 

III : I : I I I I I : I I : I I I I I : I I I : I I I I I II I I I 

PAL — ALLSASGTGSAAQDDEFFHELPETFPSDPPEPLPHFLIEPEEAYIVKNKPVNLYC 



65 



83 



KAVPATQIFFKCNGEWWQVl^HVIERSTDGSSGLPTMEV^INVSRQQVEKVFGLEEYWCQ 125 

II I I I I I : I I I I III I III:: I : I I I II I : I I I I I I : : I I I : I I I I 
KASPATQIYFKCNSEWVHQKDHVVDERVI)ETSGLIVREVSIEISRQQVEELFGPEDYWCQ 143 

CVAWS S SGTTKSQKAYI RIARLRKNFEQEPLAKEVS LEQGI VLPCRP PEGI P PAEVEWLR 185 
I I I I I I : I I I I I : I I I : I I I III I I I I I I I I I I I I I : : I I I II I I I I II ! I I I : 
CVAWS SAGTTKSRKAYVRIAYLRKTFEQEPLGKEVSLEQEVLLQCRP PEGI PVAEVEWLK 203 

NEDLVTDPSLDPNVYITREHSLVVRQARLADTANYTCV^ 245 

I I I : : II : I I III : I : I : : : I I I I : I I I I I I I I I I I I I I : I : I : I I I I I I I I I I 
NEDIIDPAEDRNFYITIDHNLIIKQARLSDTANYTCVAKNIVAKRKSTTATVIVYVNGGW 263 

STWTEWSVCSASCGRGWQKRSRSCTNPAPLNGGAFCEGQNVQKTACATLCPVDGSWSPWS 305 
I I I I I I I I I :: I I II : I I I : I : I I I I I I I I I I I I I I I I : I I I II I I I I I I I I : II 
STWTEWSVCNSRCGRGYQKRTRTCTNPAPLNGGAFCEGQSVQKIACTTLCPVDGRWTSWS 323 

KWSACGLDCTHWRSRECSDPAPRNGGEECQGTDLDTRNCTSDLCVHSASGPEDVALYVGL 365 

III II : I I I I I III: I I I : I I I : : I I I :: I I I I I : : I : I I I I I I I : 
KWSTCGTECTHWRRRECTAPAPKNGGKDCDGLVXQSKNCTDGLCMQAAPDSDDVALYVGI 383 



I I I I I I : : : I : I I : : I I : I I I I I I I I I : I I : : I I I : I 
VIAWVCLJUTVWALFVYRKNHRDFESDIIDSSALNGGFQPVNIKAARQD LLAVPP 

DLSTTTTTYQGSLCPRQDGPSPKFQLTNGHLLSPLGGGRHTLHHSS PTSEAEEFVS 

II:: I : I : I II : I I :| II : : : : I I I : II I 
DLTSAAAMYRGPVYALHD-VSDKIPMTNSPILDPLPNLKIKVYNSSGAVTPQDDLAEFSS 



424 



440 



480 



499 



526 



RLS TQNYF RSLPRGT — SNMTYGTFNFLGGRLMIPNTGISLLIPPDAI 

: I I II: : I I I I I : I I I I I I I I : I I I : I : I I I I I II 

KLSPQMTQSLLENEALNLKNQSLARQTDPSCTAFGTFNSLGGHLIIPNSGVSLLIPAGAI 559 

PRGKIYEIYLTLHKPEDVRLPLAGCQTLLSPIVSCGPPGVXLTRPVI LAMDHCGEPSPDS 586 
I : I : : I I : I : I : I : I :: I I : I I I I : I : I I I I I I I I I I I I I I I : II : I I : 
PQGRVYEMYVTVHRKENMRPPMEDSQTLLTPWSCGPPGALLTRPVI LTLHHCADPSTED 619 



Qy 



587 WSLRLKKQSCEGSWEDVXHLGEEAPSHLYYCQLEASACYVFTEQLGRFALVGEALSVAAA 646 



Db 



I : : II I : : I II I I : : I I I : I I I : I I I : : I I I : I I I I : : : I | | 
620 WKIQLKNQAVQGQWEDWWGEENFTTPCYIQLDAEACHILTENLSTYALVGQSTTKAAA 679 



Qy 647 KRLKLLLFAPVACTSLEYNIRVYCLHDTHDALKEWQLEKQLGGQLIQEPRVLHFKDSYH 706 

I I I I I : I I : I : I I II : I I I I I I II I I I I I I : I I I : I : I II I : : I I : I I I I I I 

Db 680 KRLKLAI FGPLCCSSLEYSIRVYCLDDTQDALKEVLQLERQMGGQLLEEPKALHFKGSIH 739 

Qy 707 NLRLSIHDVPSSLWKSKLLVSYQEIPFYHIWNGTQRYLHCTFTLERVSPSTSDLACKLWV 766 

I I I I I I I I : I I II I I I I I II I I I I I I I : I : I I I I I I I I I I I : I : I : I III I 

Db 740 NLRLSIHDIAHSLWKSKLLAKYQEIPFYHIWSGSQRNLHCTFTLERLSLNTVELVCKLCV 799 

Qy 767 WQVEGDGQSFSINFNITKDTRFAELLALESEAGVPALVGPSAFKIPFLIRQKIISSLDPP 826 

I I I I : I I I : I : :: : : I I : : : : I I I I I II I I I I : I I I I I 

Db 800 RQVEGEGQIFQLNCTVSEEPTGIDLPLLDPASTITTVTGPSAFSIPLPIRQKLCSSLDAP- 859 

Qy 827 CRRGADWRTLAQKLHLDSHLSFFASKPSPTAMILNLWEARHFPNGNLSQLAAAVAGLGQP 886 

II III II I I : I I : I : : I I : I III : I I : II I I : : I I : I I I I III : : I : 

Db 860 QTRGHDWRMLAHKLNLDRYLN YFATKS S PTGVI LDLWEAQNFPDGNLSMLAAVLEEMGRH 919 

Qy 887 DAGLFTVSEAE 897 

• ! • I • 

Db 920 ETWSLAAEGQ 930 

RESULT 5 
UN5C_RAT 

ID UN5C_RAT STANDARD; PRT; 931 AA. 

AC Q761X5; 

DT 25-OCT-2004 (Rel. 45, Created) 

DT 25-OCT-2004 (Rel. 45, Last sequence update) 

DT 25-OCT-2004 (Rel. 45, Last annotation update) 

DE Netrin receptor UNC5C precursor (Unc-5 homolog C) (Unc-5 homolog 3) . 

GN Name=Unc5c; Synonyms=Unc5h3 ; 

OS Rattus norvegicus (Rat) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi; Muridae; Murinae; Rattus. 

OX NCBI_TaxI D=l 0 1 1 6 ; 

RN [1] 

RP SEQUENCE FROM N.A., AND DISEASE. 

RX PubMed=15010202 ; DOI=10 . 1016/ j .molbrainres . 2003 . 12 . 003 ; 

RA Kuramoto T., Kuwamura M. , Serikawa T.; 

RT "Rat neurological mutations cerebellar vermis defect and hobble are 

RT caused by mutations in the netrin- 1 receptor gene Unc5h3."; 

RL Brain Res. Mol. Brain Res. 122:103-108(2004). 

RN [2] 

RP FUNCTION. 

RX PubMed=11387206; DOI=10 . 1093/embo j /20 . 11 . 2715 ; 

RA Llambi F. , Causeret F., Bloch-Gallego E., Mehlen P.; 

RT "Netrin- 1 acts as a survival factor via its receptors UNC5H and DCC"; 

RL EMBO J. 20:2715-2722(2001). 

CC -!- FUNCTION: Receptor for netrin required for axon guidance. Mediates 

CC axon repulsion of neuronal growth cones in the developing nervous 

CC system upon ligand binding.. Axon repulsion in growth cones may be 

CC caused by its association with DCC that may trigger signaling for 

CC repulsion. Also involved in corticospinal tract axon guidances 

CC independently of DCC. It also acts as a dependence receptor 

CC required for apoptosis induction when not associated with netrin 



CC ligand. 

CC -!- SUBUNIT: Interacts with the cytoplasmic part of DCC (By 
CC similarity) . 

CC -!- SUBCELLULAR LOCATION: Type I membrane protein (By similarity). 

CC -!- TISSUE SPECIFICITY: Mainly expressed in brain. Also expressed in 

CC kidney. Not expressed in developing or adult lung. 

CC -!- PTM: Phosphorylated on different cytoplasmic tyrosine residues. 

CC Phosphorylation of Tyr-568 leads to an interaction with PTPN11 

CC phosphatase, suggesting that its activity is regulated by 

CC phosphorylation/dephosphorylation. Tyrosine phosphorylation is 

CC netrin-dependent (By similarity) . 

CC -!- PTM: Proteolytically cleaved by caspases during apoptosis. The 
CC cleavage does not take place when the receptor is associated with 

CC netrin ligand. Its cleavage by caspases is required to induce 

CC apoptosis . 

CC -!- DISEASE: Defects in Unc5c are the cause of cerebellar vermis 

CC defect (cvd) and hobble (hob) phenotypes. Cvd and hob rats exhibit 

CC cerebellar and midbrain defects, possibly as a result of abnormal 

CC neuronal migration, and exhibit laminar structure abnormalities in 

CC the fused cerebellar hemispheres and ectopic cerebellar tissues in 

CC the cerebello-pontine junction. 

CC -!- SIMILARITY: Belongs to the UNC-5 family. 

CC -!- SIMILARITY: Contains 1 death domain. 

CC -!- SIMILARITY: Contains 1 immunoglobulin-like C2-type domain. 

CC -!- SIMILARITY: Contains 1 immunoglobulin-like domain. 

CC -!- SIMILARITY: Contains 2 TSP type-1 domains. 

CC -!- SIMILARITY: Contains 1 ZU5 domain. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; AB118026; BAD05181.1; -. 

DR RGD; 735109; UncSc. 

DR InterPro; IPR000488; Death. 

DR InterPro; IPR011029; DEATH_like. 

DR InterPro; IPR003599; Ig. 

DR InterPro; IPR007110; Ig-like. 

DR InterPro; IPR003598; Ig_c2. 

DR InterPro; IPR000884; TSP1. 

DR InterPro; IPR008085; TSP_1. 

DR InterPro; IPR000906; ZU5 . 

DR Pfam; PF00531; Death; 1. 

DR Pfam; PF00047; ig; 1. 

DR Pfam; PF00090; TSP_1; 2. 

DR Pfam; PF00791; ZU5; 1. 

DR PRINTS; PR01705; TS PI REPEAT . 



DR SMART; SM00005; DEATH; 1. 

DR SMART; SM00409; IG; 1. 

DR SMART; SM00408; IGc2; 1. 

DR SMART; SM00209; TSP1; 2. 

DR SMART; SM00218; ZU5; 1. 

DR PROSITE; PS50017; DEATH DOMAIN; FALSE NEG. 



Ur\ 


PROSITE; 


PS50835; 


IG LIKE; 


1. 


HD 


PROSITE; 


PS50092; 


TSP1; 2. 




J\W 


Apoptosis 


; Developmental protein; Immunoglobulin domain; 




Phosphorylation; 


Receptor; 


Repeat; Signal; Transmembrane. 


FT 

r i 


SIGN Ail 


1 


A f\ 

40 


Potential . 


FT 


CHAIN 


A T 

41 


yii 


Netrin receptor UNC5C. 


C 1 


DOMAIN 


41 


380 


Extracellular (Potential) . 


FT 




Jul 


401 


Potential . 


L 1 ' 1 1 

r i 


JJUMA1N 


a no 


1 


Cytoplasmic (Potential) . 


FT 


FlPkMTV TM 

DUrU\±vi 


t>Z 


ioy 


Ig-like . 


t T 


T>(~\N(7\ TM 

UUMAJ.N 


lol 


zoo 


Ig-like C2-type. 


FT 


DOMAIN 


z DU 




TSP type-1 1. 


ft 


T\/"\"fcir7\ TXT 

DOMAIN 


31o 


"3 C Q 


TSP type-1 2. 


FT 

£ J. 


DOMAIN 


528 


631 


ZU5. 


"FT 

J. X 


UOMAJ.N 


OOU 


929 


Death. 


FT 


SITE 


415 


416 


Cleavage (by caspase-3) (By similarity) . 


r l 


SITE 


694 


712 


Interaction with DCC (By similarity) . 


FT 


DISULFID 


83 


142 


By similarity. 


T«T 


DISULFID 


188 


239 


By similarity. 


FT 


MOD RES 


568 


568 


Phosphotyrosine (By similarity) . 


FT 


CARBOHYD 


236 


236 


N-linked (GlcNAc. . .) (Potential). 


FT 


CARBOHYD 


361 


361 


N-linked (GlcNAc. . . ), (Potential). 


SQ 


SEQUENCE 


931 AA 


; 103134 


MW; 25B183A97BCB8401 CRC64; 



Query Match 57.8%; Score 2770; DB 1; Length 931; 

Best Local Similarity 57.0%; Pred. No. 1.7e-199; 

Matches 519; Conservative 154; Mismatches 210; Indels 28; Gaps 9; 

Qy 9 PALLGIVLAAWLRGSGAQQS ATVANPVPGANPDLLPHFLVEPEDVYIVKNKPVLLVC 65 

M 1 : I : I I I I I : I I : I I I I I : I I I : I I I I I I I I I I 

Db 26 PAL — ALLSASGTGSAAQDDDFFHELPETFPSDPPEPLPHFLIEPEEAYIVKNKPVNLYC 83 

Qy 66 KAVPATQIFFKCNGEWVT^QVDHVIERSTDGSSG 125 

II I I II I : I I I I III I III:: I : I I I II I : I I I I I I : : I I I : I M | 
Db 84 KASPATQIYFKCNSEWVHQKDHVVI)ERvT)ETSGLIVREVSIEISRQQVEELFGPED™CQ 143 

Qy 126 CVAWSSSGTTKSQKAYIRIARLRKNFEQEPl^EVSLEQGIVXPCRPPEGIPPAEVEWLR 185 

I M I I I : II I I I : I II : I I I III I I I I II I I I I I I I : : I I I I I I I I I I I I I I I : 
Db 144 CVAWSSAGTTKSRKAYVRIAYLRKTFEQEPLGKEVSLEQEVLLQCRPPEGIPMAEVEWLK 203 

Qy 186 NEDLVTlPSLDPNWITREHSLWRQARlxADT^ 245 

I I I : : I I I I III : I : I : : : I I I I : I I I I I I I I I I I I I I : I : I : I I I I I I I I I I 
Db 204 NEDIIDPVEDRNFYITIDHNLIIKQARLSDTANYTCVAKNIVJ\KRKSTTATVIVYVNGGW 263 

Qy 246 STWTEWSVCSASCGRGWQKRSRSCTNPAPIxNGGAFCEGQNVQKTACATLCPVDGSWSPWS 305 

I M I I I I I : : III I : I I I : I : I I I I I I I I I I I I I I I I : I I I II I I I I I I I I : II 
Db 264 S TWAEWS VCNS RCGRG YQKRT RTCTN PAP LNGGAFCEGQ S VQKI ACTT LC P VDGRWT S WS 323 

Qy 306 KWSACGLDCTHWRSRECSDPAPRNGGEECQGTDLDTRNCTSDLCVHSASGPEDVALYVGL 365 

I I I I I : I I I I I I I I : I I I : I I I :: I | I : : I I I I I : : I : I I I I I I I : 
Db 324 KWSTCGTECTHWRRRECTAPAPKNGGKDCDGLVXQSKNCTDGLCMQAAPDSDDVALYVGI 383 

Qy 366 - 1 AVAVCLVLLLLVL I LVYCRKKEGLDS DVAD S S I LT S GFQ P VS I KP S KADN PHLLT I QP 424 

I I I I I I : : : I : I I : : I : : I I I I I I I I I : I I : : I I I : I 
Db 384 VI AVT VC LAI TWVAL FVYRKNHRD FE S N 1 1 D S S ALN GG FQ P VN I KAARQ D LLAVPP 440 

Qy 425 DLSTTTTT YQGSLCPRQDGPS PKFQLTNGHLLS PLGGGRHTLHHS S PTSEAEEFVS 480 



II:: 1:1 : I I I :M :| II : :::|| I : II I 
Db 441 DLTSAAAMYRGPVYALHD-VSDKIPMTNSPILDPLPNLKIKVYNSSGAVTPQDDLAEFSS 499 

Qy 481 RLS TQNYF RSLPRGT-- -SNMTYGTFNFLGGRLMIPNTGISLLIPPDAI 526 

: I I II: : I I I I I : I I I I I I I I : I I I : I : I I I I I II 

Db 500 KLSPQMTQSLLENEALNLKNQSLARQTDPSCTAFGTFNSLGGHLIIPNSGVSLLIPAGAI 559 

Qy 527 PRGKIYEIYLTLHKPEDVRLPLAGCQTLLSPIVSCGPPGVLLTRPVILAMDHCGEPSPDS 586 

I : I : : I I : I : I : I : I : : I I : I I I I : I : I I I I I I I I I I I I I I I : II : I : : 
Db 560 PQGRWEMYVTVHRKENMRPPMEDSQTLLTPWSCGPPGALLTRPVI LTLHHCADPNTED 619 

Qy 587 WSLRLKKQSCEGSWEDVLHLGEEAPSHLYYCQLEASACYVFTEQLGRFALVGEALSVAAA 646 

I : : I I I : : I I I I I : : I I I : I I I : I I I : : I I I : I I I I : : : I I I 

Db 620 WKIQLKNQAVQGQWEDWWGEENFTTPCYIQLDAEACHILTENLSTYALVGQSTTKAAA 679 

Qy 647 KRLKLLLFAPVACTSLEYNIRVYCLHDTHDALKEWQLEKQLGGQLIQEPRVLHFKDSYH 706 

II I I I : I I : I : I I I I : I I I I I I II I I I I I I : I I I : I : I I I I : : I I : I I I I I I 

Db 680 KRLKLAI FGPLCCSSLEYSIRVYCLDDTQDALKEVLQLERQMGGQLLEEPKALHFKGSIH 739 

Qy 707 NLRLS I HDVPS S LWKS KLLVS YQEI PFYHIWNGTQRYLHCT FTLERVS PSTS DLACKLWV 766 

I I I I I I I I : I I I I I I I I I I I I I I I I I I : I : I I I I I I I I II I : I : I : I III I 
Db 740 NLRLSIHDITHSLWKSKLLAKYQEIPFYHIWSGSQRNLHCTFTLERLSLNTVELVCKLCV 799 

Qy 767 WQVEGDGQSFSINFNITKDTRFAELLALESEAGVPALVGPSAFKIPFLIRQKIISSLDPP 826 

II I I : I I I : I : : : : : I I : : : : I I I I I I I I I I I : I I I I I 
Db 800 RQVEGEGQIFQLNCTVSEEPTGIDLPLLDPASTITTVTGPSAFSIPLPIRQKLCSSLDAP 859 

Qy 827 CRRGADWRTLAQKLHLDSHLS FFASKPS PTAMI LNLWEARHFPNGNLSQLAAAVAGLGQP 886 

II III II I I: LI :|::||:| III : I I : I I I I : : I I : I I I I III : :|: 
Db 860 QTRGHDWRMIJVHKLNLDRYLNYFATKSSPTGVILDLWEAQNFPDGNLSMLAAVLEEMGRH 919 

Qy 887 DAGLFTVSEAE 897 

: : :| : 
Db 920 ETWS LAAEGQ 930 



RESULT 6 
UN5C_CHICK 

ID UN5C_CHICK STANDARD; PRT; 931 AA. 

AC Q7T2Z5; 

DT 25-OCT-2004 (Rel. 45, Created) 

DT 25-OCT-2004 (Rel. 45, Last sequence update) 

DT 25-OCT-2004 (Rel. 45, Last annotation update) 

DE Netrin receptor UNC5C precursor (Unc-5 homolog C) (Unc-5 homolog 3) 

DE ( CUNC-5H3) . 

GN Name=UNC5C; 

OS Gallus gallus (Chicken) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Archosauria; Aves; Neognathae; Gallif ormes ; Phasianidae; Phasianinae; 

OC Gallus . 

OX NCBI_TaxID=9031; 

RN [1] 

RP SEQUENCE FROM N.A. , AND TISSUE SPECIFICITY. 

RX PubMed=127 99087; 

RA Guan W., Condic M.L.; 

RT "Characterization of Netrin-1, Neogenin and CUNC-5H3 expression during 

RT chick dorsal root ganglia development."; 



RL Gene Expr. Patterns 3:369-373(2003). 

CC -!- FUNCTION: Receptor for netrin required for axon guidance. Mediates 
CC axon repulsion of neuronal growth cones in the developing nervous 

CC system upon ligand binding (By similarity) . 

CC -!- SUBCELLULAR LOCATION: Type I membrane protein (By similarity). 

CC -!- TISSUE SPECIFICITY: Restricted to proprioceptive neurons. 

CC -!- PTM: Phosphorylated on cytoplasmic tyrosine residues (By 
CC similarity) . 

CC -!- SIMILARITY: Belongs to the UNC-5 family. 

CC -!- SIMILARITY: Contains 1 death domain. 

CC -!- SIMILARITY: Contains 1 immunoglobulin-like C2-type domain. 

CC -!- SIMILARITY: Contains 1 immunoglobulin-like domain. 

CC -!- SIMILARITY: Contains 2 TSP type-1 domains. 

CC -!- SIMILARITY: Contains 1 ZU5 domain. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; AY187310; AA067275.1; -. 

DR InterPro; IPR000488; Death. 

DR InterPro; IPR007110; Ig-like. 

DR InterPro; IPR003598; Ig_c2. 

DR InterPro; IPR000884; TSP1. 

DR InterPro; IPR000906; ZU5. 

DR Pfam; PF00531; Death; 1. 

DR Pfam; PF00047; ig; 1. 

DR Pfam; PF00090; TSP_1; 2. 

DR Pfam; PF00791; ZU5; 1. 

DR SMART; SM00005; DEATH; 1. 

DR SMART; SM00408; IGc2; 1. 

DR SMART; SM00209; TSP1; 2. 

DR SMART; SM00218; ZU5; 1. 



DR 


PROSITE; 


PS50017; 


DEATH 


DOMAIN; FALSE NEG. 


DR 


PROSITE; 


PS50835; 


IG LIKE; 


1. 


DR 


PROSITE; 


PS50092; 


TSP1; 


2. 




KW 


Developmental protein; 


Immunoglobulin domain; Phosphorylation; 


KW 


Receptor; 


Repeat; 


Signal; 


Transmembrane . 


FT 


SIGNAL 


1 


39 




Potential. 


FT 


CHAIN 


40 


931 




Netrin receptor UNC5C. 


FT 


DOMAIN 


40 


380 




Extracellular (Potential) . 


FT 


TRANS MEM 


381 


401 




Potential. 


FT 


DOMAIN 


402 


931 




Cytoplasmic (Potential) . 


FT 


DOMAIN 


62 


159 




Ig-like . 


FT 


DOMAIN 


161 


256 




Ig-like C2-type. 


FT 


DOMAIN 


260 


314 




TSP type-1 1. 


FT 


DOMAIN 


316 


368 




TSP type-1 2. 


FT 


DOMAIN 


528 


631 




ZU5. 


FT 


DOMAIN 


850 


929 




Death. 


FT 


DISULFID 


83 


142 




By similarity. 


FT 


DISULFID 


188 


239 




By similarity. 


FT 


CARBOHYD 


236 


236 




N-linked (GlcNAc. . .) (Potential) 


FT 


CARBOHYD 


361 


361 




N-linked (GlcNAc. . .) (Potential) 



SQ SEQUENCE 931 AA; 102906 MW; 1E23A0D84F2E2C62 CRC64; 



Query Match 57.6%; Score 2761; DB 1; Length 931; 

Best Local Similarity 57.0%; Pred. No. 7.9e-199; 

Matches 518; Conservative 151; Mismatches 212; Indels 28; Gaps 9; 



Qy 9 PALLGIVLAAWLRGSGAQQS ATVANPVPGANPDLLPHFLVEPEDVYIVKNKPVLLVC 65 

Ml Ml I I I I : I I : I I I I I : I I I : I I I I I I I I I I 

Db 26 PAL — AVLGASRPGSAAQDDDFFHELPETFPSDPPEPLPHFLIEPEEAYIVKNKPVNLYC 83 

Qy 66 KAVPATQI FFKCNGEWVRQVDHVT ERSTDGSSGLPTMEVRINVSRQQVEKVFGLEEYWCQ 125 

II 11111:1111 III I III:: I :||| II I :||||||::|| |:|||| 

Db 84 KAS PATQ I YFKCN S EWVHQKDHWDERVDET S GLI VCEVS I EI S RQQVEELFGPED YWCQ 143 

Qy 126 CVAWSSSGTTKSQKAYIRIARLRKNFEQEPLAKEVSLEQGIVLPCRPPEGIPPAEVEWLR 185 

I I I I I I : I I I I I : I I I : I I I III Mil l MM III ::| I I I i I I i I lllllh 
Db 144 CVAWSSAGTTKSRKAYWIAYLRKTFEQEPLGKEVSLEQEVXLQCRPPEGIPVAEVEWLK 203 

Qy 186 NEDLVD P S LD PNVYI TREH S LWRQARLADTAN YT CVAKN I VARRRS AS AAVT VYVN GGW 245 

ll:::M I I Ml : I : I : : : M I I : M M M M I M I M : I : I :| MUM III 
Db 204 NEEVI DP VEDRN FYI T I DHNL 1 1 KQARLS DTANYT CVAKN I VAKRKSTTATVT VYVNGGW 263 

Qy 246 STWTEWSVCSASCGRGWQKRSRSCTNPAPLNGGAFCEGQNVQKTACATLCPVDGSWSPWS 305 

I I I I I I 1 I- I I II : I I I : I : I I I I I I I I I I I I I I I II I I I II I I I i 1 1 I |: II 
Db 264 STWTEWSACNSRCGRGFQKRTRTCTNPAPLNGGAFCEGQNVQKIACTTLCPVDGKWTSWS 323 

Qy 306 KWSACGLDCTHWRSRECSDPAPRNGGEECQGTDLDTRNCTSDLCVHSASGPEDVALYVGL 365 

III II MIMI III: M|:IM::|:| I : : I I I ||: :| : M M I I I : 

Db 324 KWSTCGTECTHWRRRECTAPAPKNGGKDCEGLVLQSKNCTDGLCMQAAPDSDDVALYVGI 383 

Qy 366 -IAVAVCLVLLLLVLILVYCRKKEGLDSDVADSSILTSGFQPVSIKPSKADNPHLLTIQP 424 

I M I I I : : : I : II : : I I : I I I I I I M I : I I : : I I I : I 
Db 384 VI AVI VC LAI SVWALFVYRKNHRDFES DI I DS SALNGGFQPVNI KAARQD LLAVPP 440 

Qy 425 DLSTTTTTYQGSLCPRQDGPSPKFQLTNGHLLSPLGGGRHTLHHSS PTSEAEEFVS 480 

II:: I : I : I II Ml MM : :: :: I I I Ml 
Db 441 DLTSAAAMYRGPWALHD-VSDKIPMTNSPILDPLPNLKIKVYNTSGAVTPQDELSDFSS 499 

Qy 481 RLS TQNYF RSLPRGT — SNMT YGTFNFLGGRLMI PNTGI SLLI PPDAI 526 

Ml II: Ml I I I Mill III I : I I I : I : I I I I I |: 

Db 500 KLSPQITQSLLENETLNVl^QSLARQTDPSCTAFGTFNSLGGHLVTPNSGVSLLIPAGAV 559 

Qy 527 PRGKIYEIYLTLHKPEDVRLPLAGCQTLLSPIVSCGPPGVLLTRPVILAMDHCGEPSPDS 586 

I • I : : II : I : I : I : I M |: I I I I : I M I I I I I I MINIM I II II: I 
Db 560 PQGRVYEMYVTvIiRKEGMRPPVTSDSQTLLTPW 619 

Qy 587 WSLRLKKQSCEGSWEDVLHLGEEAPSHLYYCQLEASACYVFTEQLGRFALVGEALSVAAA 646 

I : M I I : M I II I : M I I : III: I I : : I I I : I I I I : : : : II I 
Db 620 WQIQLKHQAGQGPWEDWWGEENFTTPCYIQLDPEACHILTETLSTYALVGQSITKAAA 679 

Qy 647 KRLKLLLFAPVACTSLEYNIRVYCLHDTHDALKEWQLEKQLGGQLIQEPRVLHFKDSYH 706 

I I I II M I : M M I I I M I I I I I II I I I I I I M I I M M I I I : M I : I I I I I I 

Db 680 KRLKLAI FGPLSCSSLEYSIRVYCLDDTQDALKEVLQLERQMGGQLLEEPKTLHFKGSTH 739 

Qy 707 NLRLSIHDVPSSLWKSKLLVSYQEIPFYHIWNGTQRYLHCTFTLERVSPSTSDLACKLWV 766 

II I I I I I I : I I II I I I I I I I I I I II I M II I I II I I I II I M M III I 

Db 740 NLRLSIHDIAHSLWKSKLPAKYQEIPFYHIWSGCQRNLHCTFTLERFSLNTLELVCKLCV 799 



Qy 767 WQVEGDGQSFSINFNITKDTRFAELLALESEAGVPALVGPSAFKIPFLIRQKIISSLDPP 826 

I I I I : I I I : I : : : : : : : : I : : I I I : I I I I I I I I : I I I I I 
Db 800 RQVEGEGQI FQLNCSVSEEPTGI DYPIMDSAGSITTIVGPNAFS I PLPI RQKLCSSLDAP 859 

Qy 827 CRRGADWRTLAQKLHLDSHLSFFASKPSPTAMILNLWEARHFPNGNLSQLAAAVAGLGQP 886 

II III II II II : I : : I I : I III : I I : I I I I : : I I : I I I I III : : I : 
Db 860 QTRGHDWRMLAHKLKLDRYLNYFATKSSPTGVTLDLWEAQNFPDGNLSMLAAVLEEMGRH 919 

Qy 887 DAGLFTVSE 895 

: : : I 
Db 920 ETWSLAAE 928 

RESULT 7 
UN5C_HUMAN 

ID UN5C_HUMAN STANDARD; PRT; 931 AA. 

AC 095185; Q8IUT0; 

DT 25-OCT-2004 (Rel. 45, Created) 

DT 25-OCT-2004 (Rel. 45, Last sequence update) 

DT 25-OCT-2004 (Rel. 45, Last annotation update) 

DE Netrin receptor UNC5C precursor (Unc-5 homolog C) (Unc-5 homolog 3) . 

GN Name=UNC5C; Synonyms=UNC5H3 ; 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP SEQUENCE FROM N.A. (ISOFORM 1), AND TISSUE SPECIFICITY. 

RC TISSUE=Brain; 

RX MEDLINE=99000841; PubMed=9782087 ; DOI=10 . 1006/geno . 1998 . 5425 ; 

RA Ackerman S.L., Knowles B.B.; 

RT "Cloning and mapping of the UNC5C gene to human chromosome 4q21-q23."; 

RL Genomics 52:205-208(1998). 

RN [2] 

RP SEQUENCE FROM N.A. (ISOFORM 2) . 

RC TISSUE=Lung; 

RX MEDLINE=22388257; PubMed=12477932 ; DOI=10 . 1073/pnas . 242603899; 

RA Strausberg R.L., Feingold E.A. , Grouse L.H., Derge J.G., 

RA Klausner R.D., Collins F.S., Wagner L., Shenmen CM. , Schuler G.D., 

RA Altschul S.F., Zeeberg B. , Buetow K.H., Schaefer C.F., Bhat N.K., 

RA Hopkins R.F., Jordan H., Moore T., Max S.I., Wang J., Hsieh F. , 

RA Diatchenko L., Marusina K., Farmer A. A. , Rubin G.M., Hong L., 

RA Stapleton M. , Soares M.B., Bonaldo M.F., Casavant T.L., Scheetz T.E., 

RA Brownstein M.J., Usdin T.B., Toshiyuki S., Carninci P., Prange C, 

RA Raha S . S . , . Loquellano N.A., Peters G.J., Abramson R.D., Mullahy.. S . J. , 

RA Bosak S.A., McEwan P.J., McKernan K.J., Malek J. A. , Gunaratne P.H., 

RA Richards S., Worley K.C., Hale S., Garcia A.M., Gay L.J., Hulyk S.W., 

RA Villalon D.K., Muzny D.M., Sodergren E.J., Lu X., Gibbs R.A. , 

RA Fahey J., Helton E., Ketteman M. , Madan A., Rodrigues S., Sanchez A., 

RA Whiting M. , Madan A., Young A.C., Shevchenko Y., Bouffard G.G., 

RA Blakesley R.W., Touchman J.W., Green E.D., Dickson M.C., 

RA Rodriguez A.C., Grimwood J., Schmutz J., Myers R.M., 

RA Butterfield Y.S.N. , Krzywinski M.I., Skalska U., Smailus D.E., 

RA Schnerch A., Schein J.E., Jones S.J.M., Marra M.A. ; 

RT "Generation and initial analysis of more than 15,000 full-length human 

RT and mouse cDNA sequences."; 



RL Proc. Natl. Acad. Sci. U.S.A. 99:16899-16903(2002). 

RN [3] 

RP DOWN- REGULATION IN CANCER. 

RX PubMed=12655055; DOI=10 . 1073/pnas . 0738063100; 

RA Thiebault K., Mazelin L., Pays L., Llambi F. , Joly M.-O., 

RA Scoazec J.-Y., Saurin J.-C, Romeo G., Mehlen P.; 

RT "The netrin-1 receptors UNC5H are putative tumor suppressors 

RT controlling cell death commitment."; 

RL Proc. Natl. Acad. Sci. U.S.A. 100:4173-4178(2003). 

CC -!- FUNCTION: Receptor for netrin required for axon guidance. Mediates 
CC axon repulsion of neuronal growth cones in the developing nervous 

CC system upon ligand binding. Axon repulsion in growth cones may be 

CC caused by its association with DCC that may trigger signaling for 

CC repulsion. Also involved in corticospinal tract axon guidances 

CC independently of DCC. It also acts as a dependence receptor 

CC required for apoptosis induction when not associated with netrin 

CC ligand (By similarity) . 

CC -!- SUBUNIT: Interacts with the cytoplasmic part of DCC (By 
CC similarity) . 

CC -!- SUBCELLULAR LOCATION: Type I membrane protein (By similarity). 

CC -!- ALTERNATIVE PRODUCTS: 

CC Event=Alternative splicing; Named isoforms=2; 

CC Name=l; 

CC Isold=095185-1; Sequence=Displayed; 

CC Name=2 ; 

CC IsoId=095185-2; Sequence=VSP_011700, VSP_011701; 

CC -!- TISSUE SPECIFICITY: Mainly expressed in brain. Also expressed in 

CC kidney. Not expressed in developing or adult lung. 

CC -!- PTM: Phosphorylated on different cytoplasmic tyrosine residues. 

CC Phosphorylation of Tyr-568 leads to an interaction with PTPN11 

CC phosphatase, suggesting that its activity is regulated by 

CC phosphorylation/dephosphorylation. Tyrosine phosphorylation is 

CC netrin-dependent (By similarity) . 

CC -!- PTM: Proteolytically cleaved by caspases during apoptosis. The 
CC cleavage does not take place when the receptor is associated with 

CC netrin ligand. Its cleavage by caspases is required to induce 

CC apoptosis (By similarity) . 

CC -!- MISCELLANEOUS: Down- regulated in multiple cancers including 
CC colorectal, breast, ovary, uterus, stomach, lung, or kidney 

CC cancers . 

CC -!- SIMILARITY: Belongs to the UNC-5 family. 

CC -!- SIMILARITY: Contains 1 death domain. 

CC - ! - SIMILARITY: Contains 1 immunoglobulin- like C2-type domain. 

CC -!- SIMILARITY: Contains 1 immunoglobulin-like domain. 

CC -!- SIMILARITY: Contains 2 TSP type-1 domains. 

CC -!- SIMILARITY: Contains 1 ZU5 domain. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; AF055634; AAC67491.1; -. 

DR EMBL; BC041156; AAH41156.1; 



DR HSSP; P07996; 1LSL. 

DR Genew; HGNC: 12569; UNC5C. 

DR MIM; 603610; -. 

DR GO; GO: 0005042; Frnetrin receptor activity; TAS. 

DR GO; GO:0007411; P:axon guidance; TAS. 

DR GO; GO:0007420; P:brain development; TAS. 

DR InterPro; IPR000488; Death. 

DR InterPro; IPR011029; DEATH_like. 

DR InterPro; IPR007110; Ig-like. 

DR InterPro; IPR003598; Ig_c2. 

DR InterPro; IPR000884; TSP1. 

DR InterPro; IPR008085; TSP_1. 

DR InterPro; IPR000906; ZU5. 

DR Pfam; PF00531; Death; 1. 

DR Pfam; PF00047; ig; 1. 

DR Pfam; PF00090; TSP_1; 2. 

DR Pfam; PF00791; ZU5; 1. 

DR PRINTS; PR01705; TSP1REPEAT. 

DR SMART; SM00005; DEATH; 1. 

DR SMART; SM00408; IGc2; 1. 

DR SMART; SM00209; TSP1; 2. 

DR SMART; SM00218; ZU5; 1. 

DR PROSITE; PS50017; DEATH_DOMAIN ; FALSE_NEG. 

DR PROSITE; PS50835; IG_LIKE; 1. 

DR PROSITE; PS50092; TSP1; 2. 

KW Alternative splicing; Apoptosis; Developmental protein; 

KW Immunoglobulin domain; Phosphorylation; Polymorphism; Receptor; 



KW 


Repeat; Signal; 


Transmembrane . 


FT 


SIGNAL 


1 


40 


Potential . 


FT 


CHAIN 


41 


931 


Netrin receptor UNC5C. 


FT 


DOMAIN 


41 


380 


Extracellular (Potential) . 


FT 


TRANS MEM 


381 


401 


Potential . 


FT 


DOMAIN 


402 


931 


Cytoplasmic (Potential) . 


FT 


DOMAIN 


62 


159 


Ig-like. 


FT 


DOMAIN 


161 


256 


Ig-like C2-type. 


FT 


DOMAIN 


260 


314 


TSP type-1 1. 


FT 


DOMAIN 


316 


368 


TSP type-1 2. 


FT 


DOMAIN 


528 


631 


ZU5 . 


FT 


DOMAIN 


850 


929 


Death. 


FT 


SITE 


415 


416 


Cleavage (by caspase-3) (By similarity) 


FT 


SITE 


694 


712 


Interaction with DCC (By similarity) . 


FT 


DISULFID 


83 


142 


By similarity. 


FT 


DISULFID 


188 


239 


By similarity. 


FT 


MOD RES 


568 


568 


Phosphotyrosine (By similarity) . 


FT 


CARBOHYD 


236 


236 


N-linked (GlcNAc. . .) (Potential). 


FT 


CARBOHYD 


361 


361 


N-linked (GlcNAc. . .) (Potential). 


FT 


VARSPLIC 


370 


370 


T -> SFIYPISTEQRTQNEYGFSS (in isoform 2 


FT 








/FTId=VSP_011700. 


FT 


VARSPLIC 


579 


931 


Missing (in isoform 2) . 


FT 








/FTId=VSP 011701. 


FT 


VARIANT 


37 


37 


G -> V (in dbSNP:2306715) . 


FT 








/FTId=VAR 019731. 


FT 


VARIANT 


721 


721 


T -> M (in dbSNP:2289043) . 


FT 








/FTId=VAR_019732. 


FT 


CONFLICT 


219 


219 


T -> I (in Ref . 1) . 


FT 


CONFLICT 


489 


489 


S -> T (in Ref. 1) . 


SQ 


SEQUENCE 


931 AA; 103101 


MW; EFD71122C98DABB8 CRC64; 



Query Match 57.5%; Score 2755; DB 1; Length 931; 

Best Local Similarity 56.4%; Pred. No. 2.2e-198; 

Matches 514; Conservative 154; Mismatches 215; Indels 28; Gaps 



9; 



Qy 9 PALLGIVLAAWLRGSGAQQS ATVANPVPGANPDLLPHFLVEPEDVYIVKNKPVLLVC 65 

III : I : I I I I I : I I : I I I I I : I I I : I I I I I I I I I \ 

Db 26 PAL--ALLSASGTGSAAQDDDFFHELPETFPSDPPEPLPHFLIEPEEAYIVKNKPVNLYC 83 

Qy 66 KAVPATQI FFKCNGEWVRQVDHVT ERSTDGS SGLPTMEVRINVS RQQVEKVFGLEEYWCQ 125 

II I I I I I : I II I III I II::: I : I I I II I : I I I I I I : : I I I : I I I I 

Db 84 KASPATQIYFKCNSEWV7iQKDHIVI)ERvI)ETSGLI VT^EVSIEISRQQVEELFGPEDYWCQ 143 

Qy 126 CVAWSSSGTTKSQKAYIRIARLRKNFEQEPLAKEVSLEQGIVLPCRPPEGIPPAEVEWLR 185 

I I I I I I : II I II : I II : I I I III I I I I I I I I I I I I I : : I I I I I I I I I MINI: 
Db 144 CVAWSSAGTTKSRKAYVRIAYLRKTFEQEPLGKEVSLEQEVLLQCRPPEGIPVAEVEWLK 203 

Qy 186 NEDLVD P S LD PNVYI T REH S LWRQARLADTAN YT C VAKN I VARRRS AS AAVI VYVNGGW 245 

I I I : : I I I I III : I : I : : : I I I I : I I I I I I I I I I I I II : I : I : I I I I I I I I I I 
Db 204 NEDIIDPVEDRNFYITIDHNLIIKQARLSDTA^ 263 

Qy 246 STWTEWSVCSASCGRGWQKRSRSCTNPAPLNGGAFCEGQNVQKTACATLCPVDGSWSPWS 305 

I I I I I I I I I : : I I I I : I I I : I : I I I I I I I I I I I I I I II : I I I II I I II I I I hill 
Db 264 STWTEWSVCNSRCGRGYQKRTRTCTNPAPLNGGAFCEGQSVQKIACTTLCPVDGRWTPWS 323 

Qy 306 KWSACGLDCTHWRSRECSDPAPRNGGEECQGTDLDTRNCTSDLCVHSASGPEDVALYVGL 365 

III II : I I I I I III: I I I : I I I : : I I I : : I I I II: : I : I I I I I I I : 

Db 324 KWSTCGTECTHWRRRECTAPAPKNGGKDCDGLVLQSKNCTDGLCMQTAPDSDDVALYVGI 383 

Qy 366 -IAVAVCLVLLLLVLILVYCRKKEGLDSDVADSSILTSGFQPVSIKPSKADNPHLLTIQP 424 

I I I I I I : : : I : I I : : I I : I I I I I I I I I : I I : : I I I : I 
Db 384 VI AVT VC LAI S VWAL FVYRKNH RD FE S D 1 1 D S S ALN GG FQ P VN I KAARQ D LLAVPP 440 

Qy 425 DLSTTTTTYQGSLCPRQDGPSPKFQLTNGHLLSPLGGGRHTLHHS SPTSEAEEFVS 480 

II:: I : I : I II Ml : I I I : :::: II : III 
Db 441 DLTSAAAMYRGPVYALHD-VSDKIPMTNSPILDPLPNLKIKVYNTSGAVSPQDDLSEFTS 499 

Qy 481 RLS TQNYF RSLPRGT — SNMTYGTFNFLGGRLMIPNTGISLLIPPDAI 526 

: I I II: : I I I I I : I : I I I I I I : : i I : I : I I I I I II 

Db 500 KLSPQMTQSLLENEALSLKNQSLARQTDPSCTAFGSFNSLGGHLIVPNSGVSLLIPAGAI 559 

Qy 527 PRGKIYEIYLTLHKPEDVTUiPLAGCQTLLSPIVSCGPPGVXLTRPVILAMDHCGEPSPDS 586 

I : I : : I I : I : I : I : I : I I : II I I : I : I I I I II I II I I I I : I I I I : I : : 
Db 560 PQGRWEMYVTVHRKETMRPPMDDSQTLLTPWSCGPPGALLT 619 

Qy 587 WSLRLKKQSCEGSWEDVLHLGEEAPSHLYYCQLEASACYVFTEQLGRFALVGEALSVAAA 646 

I : I I I : : I I I I I : : I I I : I : I : I I I :: I I I : I I I I : : I I I 
Db 620 WKILLKNQAAQGQWEDWWGEENFTTPCYIKLDAEACHILTENLSTYALVGHSTTKAAA 679 

Qy 647 KRLKLLLFAPVACTSLEYNIRVYCLHDTHDALKEWQLEKQLGGQLIQEPRVXHFKDSYH 706 

I I I I I : I I : I : I I I I : I I I I I I II INN:: MM I I I I :: I I : MM I I 
Db 680 KRLKLAIFGPLCCSSLEYSIRVYCLDDTQDALKEILHLERQTGGQLLEEPKALHFKGSTH 739 

Qy 707 NLRLSIHDVPSSLWKSKLLVSYQEIPFYHIWNGTQRYLHCTFTLERVSPSTSDLACKLWV 766 

I I I M I I I : I I I II II I II M II I I M M : I I I I I I II I I I I : I M III I 
Db 740 NLRLS I HDI AHS LWKSKLLAKYQEI PFYHVWS GSQRNLHCT FTLERFS LNTVELVCKLCV 799 



Qy 767 WQVEGDGQS FS INFNITKDTRFAELLALESEAGWALVGPSAFKI PFLIRQKI I SSLDPP 826 

Nihil l' M :::: :| |: : : Mill II INI: I I I I I 
Db 800 RQVEGEGQI FQLNCTVSEEPTGIDLPLLDPANTITTVTGPSAFS I PLPI RQKLCSSLDAP 859 

Qy 827 CRRGADWRTLAQKLHLDSHLS FFAS KPS PTAMI LNLWEARHFPNGNLSQLAAAVAGLGQP 886 

II Ml II M : I I :|::MM Ml : M : I I M : : II : II I I III : :|: 
Db 860 QTRGHDWRMLAHKLNLDRYLNYFATKSSPTGVILDLWEAQNFPDGNLSMLAAVLEEMGRH 919 

Qy 887 DAGLFTVSEAE 897 

Db 920 ETWS LAAEGQ 930 



RESULT 8 
UN5B XENLA 



ID UN5B_XENLA STANDARD; PRT; 943 AA. 

AC Q8JGT4; 

DT 25-OCT-2004 (Rel. 45, Created) 

DT 25-OCT-2004 (Rel. 45, Last sequence update) 

DT 25-OCT-2004 (Rel. 45, Last annotation update) 

DE Netrin receptor UNC5B precursor (UNC-5 homolog) (Protein XUNC-5) . 

OS Xenopus laevis (African clawed frog) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 

OC Amphibia; Batrachia; Anura; Mesobatrachia; Pipoidea; Pipidae; 

OC Xenopodinae; Xenopus. 

OX NCBI_TaxID=8355 ; 

RN [1] 

RP SEQUENCE FROM N.A. , AND TISSUE SPECIFICITY. 

RX MEDLINE=22239703; PubMed=12351179; DOI=10 . 1016/S0925-4773 ( 02 ) 00215-0; 

RA Anderson R.B., Holt C.E.; 

RT "Expression of UNC-5 in the developing Xenopus visual system."; 

RL Mech. Dev. 118:157-160(2002). 

CC -!- FUNCTION: Receptor for netrin required for axon guidance. Mediates 

CC axon repulsion of neuronal growth cones in the developing nervous 

CC system upon ligand binding (By similarity) . 

CC -!- SUBCELLULAR LOCATION: Type I membrane protein (By similarity). 

CC -!- TISSUE SPECIFICITY: In the developing visual system, it is 

CC expressed within the developing optic vesicles and later become 

CC restricted to the dorsal ciliary marginal zone, a site of 

CC retinoblast proliferation and differentiation. 

CC PTM: Phosphorylated on cytoplasmic tyrosine residues (By 

CC similarity) . 

CC -!- SIMILARITY: Belongs to the UNC-5 family. 

CC -!- SIMILARITY: Contains 1 death domain. 

CC -!- SIMILARITY: Contains 1 immunoglobulin-like C2-type domain. ' 

CC ,_-!- SIMILARITY: Contains 1 immunoglobulin-like domain. 

CC -!- SIMILARITY: Contains 2 TSP type-1 domains. 

CC -!- SIMILARITY: Contains 1 ZU5 domain. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC • 



DR EMBL; AY099459; AAM34486.1; -. 

DR HSSP; P07996; 1LSL. 

DR InterPro; IPR000488; Death. 

DR InterPro; IPR011029; DEATH_like. 

DR InterPro; IPR007110; Ig-like. 

DR InterPro; IPR003598; Ig_c2. 

DR InterPro; IPR000884; TSP1. 

DR InterPro; IPR008085; TSP_1. 

DR InterPro; IPR000906; ZU5 . 

DR Pfam; PF00531; Death; 1. 

DR Pfam; PF00047; ig; 1. 

DR Pfam; PF00090; TSP_1; 2. 

DR Pfam; PF00791; ZU5; 1. 

DR PRINTS; PR01705; TSP1REPEAT. 

DR SMART ; SM00005; DEATH; 1. 

DR SMART; SM00408; IGc2; 1. 

DR SMART; SM00209; TSP1; 2. 

DR SMART; SM00218; ZU5; 1. 

DR PROSITE; PS50017; DEATH_DOMAIN; FALSE_NEG. 

DR PROSITE; PS50835; IG_LIKE; 1. 

DR PROSITE; PS50092; TSP1; 2. 

KW Developmental protein; Immunoglobulin domain; Phosphorylation; 



KW 


Receptor; 


Repeat; 


Signal; 


Transmembrane. 


FT 


SIGNAL 


1 


30 


Potential. 


FT 


CHAIN 


31 


943 


Netrin receptor UNC5B. 


FT 


DOMAIN 


31 


380 


Extracellular (Potential) . 


FT 


TRANSMEM 


381 


401 


Potential . 


FT 


DOMAIN 


402 


943 


Cytoplasmic (Potential) . 


FT 


DOMAIN 


51 


148 


Ig-like. 


FT 


DOMAIN 


150 


245 


Ig-like C2-type. 


FT 


DOMAIN 


249 


303 


TSP type-1 1. 


FT 


DOMAIN 


305 


357 


TSP type-1 2. 


FT 


DOMAIN 


540 


643 


ZU5. 


FT 


DOMAIN 


863 


941 


Death. 


FT 


DISULFID 


72 


131 


By similarity. 


FT 


DISULFID 


177 


228 


By similarity. 


FT 


CARBOHYD 


225 


225 


N-linked (GlcNAc. . .) (Potential) 


FT 


CARBOHYD 


350 


350 


N-linked (GlcNAc. . .) (Potential) 


SQ 


SEQUENCE 


943 AA; 


105083 


MW; A024E24A7EDB6175 CRC64; 



Query Match 55.2%; Score 2646.5; DB 1; Length 943; 

Best Local Similarity 53.0%; Pred. No. 3.4e-190; 

Matches 496; Conservative 163; Mismatches 229; Indels 47; Gaps 8; 

Qy 10 ALLGIVLAAWLRG S GAQQ SAT VAN P VP GAN P DLL PH FLVEP EDVYI VKN KPVX 62 

-U-l : I I : : I : I : : II : I I I I I : I I I I -UN- 1 I I I I 

Db 10 AALAAILVALILSCNFPSSTAGIEYSDVLPDSFPSAPAESLPHFLLEPEDAYIVKNKPVE 69 



Qy 63 LVCKAVPATQIFFKCNGEWVRQVDHVIERSTDGSSGLPTMEVRINVSRQQVEKVFGLEEY 122 

I I I I I I I I I I : I I I I I I I I I I I: : I :|| I I : I I I I I I I I :: I I I I : I 
Db 70 LVCKAN PATQ I YFKCNGEWVNQNDH I TKERVDDVTGLWREVQI EVS RQQVEELFGLED Y 129 

Qy 123 WCQCVAWS S S GTTKSQKAYI RI ARLRKNFEQEPLAKEVS LEQGI VLPCRPPEGI P PAEVE 182 

I I I I I I I I I : I I I I I ::: I : I I I I I I I I : I I I I I I I : I I I : I I I I I I I : I I II I I 
Db 130 WCQCVAWS SAGTTKSKRSYVRIAYLRKNFDQEPLGKEVALEQEALLQCRPPEGVP PAEVE 189 

Qy 183 WLRN EDLVD P S LD PNVYI TREH S LWRQARLADTAN YT CVAKN I VARRRS AS AAVI VYVN 242 



Db 


190 


Qy 


243 


Db 


250 


Qy 


303 


Db 


310 


Qy 


353 


Db 


370 


Qy 


410 


Db 


430 


Qy 


468 


Db 


489 


Qy 


504 


Db 


549 


Qy 


564 


Db 


609 


Qy 


624 


Db 


669 


Qy 


684 


Db 


729 


Qy 


744 


Db 


789 


Qy 


804 


Db 


849 


Qy 


864 


Db 


909 



I I II : I : I : : : I I I I : I I I I I I I I : | | | | | : | | | :| |||:|| 
WLKNEEIIDPTKDTNFLITIDHNLIIKQARLSDTANYTCVSKNIVAKRRSTTATVTVFVN 249 

GGWSTWT EWS VC S AS CGRGWQKRS RS CTN PAP LNGGAFC EGQNVQKT ACAT LC PVDGS WS 302 
I I I I : I I I I I I : II I I I I I : I : I I I I I I I I I I I I I I II I I I : I I I I I I : 
GGWSSWTEWSPCNNRCGHGWQKRTRTCTNPAPLNGGTMCEGQQYQKFACNTMCPVDGGWT 309 

PWSKWSACGLDCTHWRSRECSDPAPRNGGEECQGTDLDTRNCTSDLCVHS 352 

I I I I I I I : I I I I I I I I I : I I : I I I : : I I I I : : I I I I I : : 
EWS KWS ACSTECTHWRS RECNAPT P KNGGKDCS GMLLDS KNCTDGLCMQNKRVLGETKS R 3 6 9 

-ASGPEDVALYVGL- 1 AVAVCLVLLLLVLI LVYCRKKEGLDS DVADS S - 1 LTSGFQPVS I 409 

I I I I I I I :|: : ::| I: I |:| I I MM III II II II: 

LLESTGDVALYAGLVVAIFIVIILLMAVGIVVYRRNCREFDTDITDSSAALTGGFHPVNF 429 

KPSKADNPHLL — TIQPDLSTTTTTYQGSLCPRQDGPSPKFQLTNGHLLSPLGGGRHTLH 4 67 
I I : I I I : : I I I I : I : I : : II : I : I I I I I I : : : 

KTSRHDNSQLIHPAMQPDLTANAGIYRGNMYALQDS-ADKIPMTNSPLLDPLPNLKIKVY 488 

HSS PTSEAEEFVSRLSTQN YFRSLPRGTSNMTYGTF 503 

: II II : : : : | : : I I I : I I III 

NSSTVGSSPGIHDGNNLLGTKPTGTYPSDNNIMNARNKNMSMQHLLTLPRDSSNSVTGTF 548 

NFLGGRLMIPNTGISLLIPPDAIPRGKIYEIYLTLHKPEDVRLPLAGCQTLLSPIVSCGP 563 

I I I I I 1111:11111 I I I : I I I I : I I : : I I : II I I I : I I I I : : I I I 
GSLGGRLTFPNTGVSLLIPQGAIPQGKYYEMYLMINKRENTVLPSEGTQTILSPIITCGP 608 

PGVLLTRPVILAMDHCGEPSPDSWSLRLKKQSCEGSWEDVLHLGEEAPSHLYYCQLEASA 623 

I : I I : I I I I : I I : : I I : I I I I : I : I I : I : I I I : Mill:: 
TGLLLCKPVT LTVPHCADINTSDWILQLKTQSHQGNWEEWTLNEETLNTPCYCQLESHS 668 

CYVFTEQLGRFALVGEALSVAAAKRLKLLLFAPVACTSLEYNIRVYCLHDTHDALKEWQ 683 
I : MM : I III: I : I I I I : I : I I I : I I I I I I I : : I I I : II I I I I I I : : 
CHTLLDQLGTYAFVGESYSRSAIKRLQLAIFAPMLCTSLEYNLKVYCVEDTPDALKEVLE .728 

LEKQLGGQLIQEPRVLHFKDSYHNLRLSIHDVPSSLWKSKLLVSYQEIPFYHIWNGTQRY 743 
III III |::M::I I I I I I II I I I I I I I : I I I I : I I I : I I I I I I I I I I : I : I I 
LEKTLGGYLVEEPKLLMFKDSYHNLRLSIHDIPHSLWRSKLMAKYQEIPFYHIWSGSQRT 788 

LHCTFTLERVSPSTSDLACKLWVWQVEGDGQSFSINFNITKDTRFAELLALESEAGVPAL 803 

I I I I I I I I I I : : : I I I : I I I II : I I I : : : : : : : : : I I 
LHCTFTLERYSLAATELTCKICWQVEGEGQIFQLHTLLEENVKSFDPFCSQAENSVTTH 848 

VGPSAFKIPFLIRQKIISSLDPPCRRGADWRTLAQKLHLDSHLSFFASKPSPTAMILNLW 863 

:M MINI Mill MM I II III Mill :| :|::|IM III MIMI 

LGP YAFKI P FS I RQKI CNS LDAPNSRGNDWRLLAQKLCMDRYLN YFATKAS PTGVT LDLW 908 

EARHFPNGNLSQLAAAVAGLGQPDAGLFTVSEAEC 898 

II I : I : I : II : I : : I : : I :: : | 
EALHQDDGDLNTLASALEEMGKSEMMLVMATDGDC 943 



RESULT 9 
UN5B_MOUSE 

ID UN5B_MOUSE STANDARD; PRT; 945 AA. 

AC Q8K1S3; Q6PFH0; Q80Y85; Q9D398; 
DT 25-OCT-2004 (Rel. 45, Created) 



DT 25-OCT-2004 (Rel. 45, Last sequence update) 

DT 25-OCT-2004 (Rel. 45, Last annotation update) 

DE Netrin receptor UNC5B precursor (Unc-5 homolog B) (Unc-5 homolog 2) . 

GN Name=Unc5b; Synonyms =Unc5h2 ; 

OS Mus musculus (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi; Muridae; Murinae; Mus. 

OX NCBI_TaxID=10090; 

RN [1] 

RP SEQUENCE FROM N.A. (ISOFORM 1), AND TISSUE SPECIFICITY. 

RX MEDLINE=22239710; PubMed=12351186; DOI=10 . 1016/S0925-4773 ( 02 ) 00248-4 ; 

RA Engelkamp D . ; 

RT "Cloning of three mouse unc-5 genes and their expression patterns at 

RT mid-gestation . " ; 

RL Mech. Dev. 118:191-197(2002). 

RN [2] 

RP SEQUENCE FROM N.A. (ISOFORM 1) . 

RC STRAIN=C57BL/6J; TISSUE=Medulla oblongata; 

RX MEDLINE=22354683; PubMed=124 66851; DOI=10 . 1038/nature01266; 

RA Okazaki Y. , Furuno M. , Kasukawa T., Adachi J., Bono H., Kondo S., 

RA Nikaido I., Osato N., Saito R. , Suzuki H., Yamanaka I., Kiyosawa H., 

RA Yagi K., Tomaru Y., Hasegawa Y., Nogami A., Schonbach C, Gojobori T., 

RA Baldarelli R. , Hill D.P., Bult C, Hume D.A., Quackenbush J., 

RA Schriml L.M., Kanapin A., Matsuda H., Batalov S., Beisel K.W., 

RA Blake J. A., Bradt D., Brusic V. , Chothia C, Corbani L.E., Cousins S., 

RA Dalla E., Dragani T.A. , Fletcher C.F., Forrest A., Frazer K.S., 

RA Gaasterland T., Gariboldi M. , Gissi C, Godzik A., Gough J. , 

RA Grimmond S., Gustincich S., Hirokawa N., Jackson I.J., Jarvis E.D., 

RA Kanai A., Kawaji H., Kawasawa Y., Kedzierski R.M., King B.L., 

RA Konagaya A., Kurochkin I.V. , Lee Y. , Lenhard B., Lyons P. A., 

RA Maglott D.R., Maltais L., Marchionni L., McKenzie L., Miki H., 

RA Nagashima T., Numata K. , Okido T., Pavan W.J., Per tea G., Pesole G. , 

RA Petrovsky N., Pillai R., Pontius J.U., Qi D., Ramachandran S., 

RA Ravasi T., Reed J.C., Reed D.J., Reid J., Ring B.Z., Ringwald M. , 

RA Sandelin A., Schneider C, Semple C.A. , Setou M. , Shimada K., 

RA Sultana R. , Takenaka Y. , Taylor M.S., Teasdale R. D. , Tomita M. , 

RA Verardo R. , Wagner L., Wahlestedt C, Wang Y., Watanabe Y. f Wells C, 

RA Wilming L.G., Wynshaw-Boris A. , Yanagisawa M. , Yang I., Yang L., 

RA Yuan Z., Zavolan M. , Zhu Y., Zimmer A., Carninci P., Hayatsu N., 

RA Hirozane-Kishikawa T., Konno H. f Nakamura M. , Sakazume N., Sato K. , 

RA Shiraki T., Waki K. , Kawai J. , Aizawa K w Arakawa T. , Fukuda S., 

RA Hara A., Hashizume W., Imotani K., Ishii Y., Itoh M. , Kagawa I., 

RA Miyazaki A., Sakai K., Sasaki D., Shibata K. , Shinagawa A., 

RA Yasunishi A. , Yoshino M. , Waterston R. , Lander E.S. f Rogers J., 

RA Birney E . , Hayashizaki Y. ; 

RT "Analysis of the mouse transcriptome based on functional annotation of 

RT 60,770 full-length cDNAs . " ; 

RL Nature 420:563-573(2002). 

RN [3] 

RP SEQUENCE FROM N.A. (ISOFORM 2) . 

RC STRAIN=C57BL/6; TISSUE=Brain; 

RX MEDLINE=22388257; PubMed=12477932 ; DOI=10 . 1073/pnas . 242603899; 

RA Strausberg R.L., Feingold E.A., Grouse L.H., Derge J.G., 

RA Klausner R.D., Collins F.S., Wagner L., Shenmen CM., Schuler G.D., 

RA Altschul S.F., Zeeberg B., Buetow K.H., Schaefer C.F., Bhat N.K., 

RA Hopkins R.F., Jordan H., Moore T., Max S.I., Wang J., Hsieh F., 

RA Diatchenko L., Marusina K. , Farmer A. A., Rubin G.M., Hong L., 



RA Stapleton M., Soares M.B., Bonaldo M.F., Casavant T.L., Scheetz T.E., 

RA Brownstein M.J., Usdin T.B., Toshiyuki S., Carninci P., Prange C, 

RA Raha S.S., Loquellano N.A., Peters G.J., Abramson R. D. , Mullahy S.J., 

RA Bosak S.A., McEwan P.J., McKernan K.J., Malek J. A., Gunaratne P.H., 

RA Richards S., Worley K.C., Hale S., Garcia A.M.,. Gay L.J., Hulyk S.W., 

RA Villalon D.K., Muzny D.M., Sodergren E.J., Lu X., Gibbs R.A. , 

RA Fahey J., Helton E., Ketteman M., Madan A. , Rodrigues S., Sanchez A., 

RA Whiting M. , Madan A. , Young A.C., Shevchenko Y. , Bouffard G.G., 

RA Blakesley R.W., Touchman J.W., Green E.D., Dickson M.C., 

RA Rodriguez A.C., Grimwood J., Schmutz J., Myers R.M. , 

RA Butterfield Y.S.N. , Krzywinski M.I., Skalska U., Smailus D.E., 

RA Schnerch A. , Schein J.E., Jones S.J.M., Marra M.A. ; 

RT "Generation and initial analysis of more than 15,000 full-length human 

RT and mouse cDNA sequences."; 

RL Proc. Natl. Acad. Sci. U.S.A. 99:16899-16903(2002). 

RN [4] 

RP TISSUE SPECIFICITY. 

RX PubMed=12 7 99072; 

RA Dalvin S., Anselmo M.A. , Prodhan P . , Komatsuzaki K., Schnitzer J. J., 

RA Kinane T . B . ; 

RT "Expression of Netrin-1 and its two receptors DCC and UNC5H2 in the 

RT developing mouse lung."; 

RL Gene Expr. Patterns 3:279-283(2003). 

CC -!- FUNCTION: Receptor for netrin required for axon guidance. Mediates 
CC axon repulsion of neuronal growth cones in the developing nervous 

CC system upon ligand binding. Axon repulsion in growth cones may be 

CC caused by its association with DCC that may trigger signaling for 

CC repulsion. It also acts as a dependence receptor required for 

CC apoptosis induction when not associated with netrin ligand (By 

CC similarity) . 

CC -!- SUBUNIT: Interacts with the cytoplasmic part of DCC. Interacts 
CC with GNAI2 via its cytoplasmic part (By similarity). 

CC -!- SUBCELLULAR LOCATION: Type I membrane protein (By similarity). 

CC -!- ALTERNATIVE PRODUCTS: 

CC Event=Alternative splicing; Named isoforms=2; 

CC Name=l; 

CC IsoId=Q8KlS3-l; Sequence=Displayed; 

CC Name=2 ; 

CC IsoId=Q8KlS3-2; Sequence=VSP_011699; 

CC -!- TISSUE SPECIFICITY: Highly expressed in brain. Expressed in lung 
CC during late development. Expressed during early blood vessel 

CC formation, in the semicircular canal and in a dorsal to ventral 

CC gradient in the retina. 

CC -!- PTM: Phosphorylated on cytoplasmic tyrosine residues (By 
CC similarity) . 

CC -!- PTM: Proteolytically cleaved by caspases during apoptosis. The 

CC cleavage does not take place when the receptor is associated with 

CC netrin ligand. Its cleavage by caspases is required to induce 

CC apoptosis (By similarity) . 

CC -!- SIMILARITY: Belongs to the UNC-5 family. 

CC -!- SIMILARITY: Contains 1 death domain. 

CC -!- SIMILARITY: Contains 1 immunoglobulin-like C2-type domain. 

CC -!- SIMILARITY: Contains 1 immunoglobulin-like domain. 

CC -!- SIMILARITY: Contains 2 TSP type-1 domains. 

CC -!- SIMILARITY: Contains 1 ZU5 domain. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 



cc 
cc 
cc 
cc 
cc 
cc 
cc 

DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
KW 
KW 
KW 



between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 
the European Bioinf ormatics Institute. There are no restrictions on its 
use by non-profit institutions as long as its content is in no way 
modified and this statement is not removed. Usage by and for commercial 
entities requires a license agreement (See http://www.isb-sib.ch/announce/ 
or send an email to license@isb-sib. ch) . 

EMBL; AJ487853; CAD32251.1; -. 
EMBL; AK018177; BAB31108.1; 

BC048162; AAH48162.1; ALT_INIT. 
BC057560; AAH57560.1; 
P07996; 1LSL. 
MGD; MGI: 894703; UncSb. 
InterPro; IPR000488; Death. 

IPR011029; DEATH_like. 
IPR007110; Ig-like. 
IPR003598; Ig_c2. 
IPR000884; TSP1. 
IPR008085; TSP_1. 
IPR000906; ZU5. 
Pfam; PF00531; Death; 1. 
Pfam; PF00047; ig; 1. 
Pfam; PF00090; TSP_1; 2. 
Pfam; PF00791; ZU5; 1. 
PRINTS; PR01705; TSP1REPEAT. 
SMART; SM00005; DEATH; 1. 
SMART; SM00408; IGc2; 1. 
SMART; SM00209; TSP1; 2. 
SMART; SM00218; ZU5; 1. 
PROSITE; PS50017; DEATHJDOMAIN; 1. 
PROSITE; PS50835; IG_LIKE; 1. 
PROSITE; PS50092; TSP1; 2. 

Alternative splicing; Apoptosis; Developmental protein; 
Immunoglobulin domain; Phosphorylation; Receptor; Repeat; Signal; 
Transmembrane . 



EMBL; 
EMBL; 
HSSP; 



InterPro; 
InterPro; 
InterPro; 
InterPro; 
InterPro; 
InterPro; 



FT 


SIGNAL 


1 


26 


Potential . 


FT 


CHAIN 


27 


945 


Netrin receptor UNC5B. 


FT 


DOMAIN 


27 


377 


Extracellular (Potential) . 


FT 


TRANSMEM 


378 


398 


Potential . 


FT 


DOMAIN 


399 


945 


Cytoplasmic (Potential) . 


FT 


DOMAIN 


48 


145 


Ig-like. 


FT 


DOMAIN 


153 


242 


Ig-like C2-type. 


FT 


DOMAIN 


246 


' 300 


TSP type-1 1. 


FT 


DOMAIN 


302 


354 


TSP type-1 2. 


FT 


DOMAIN 


541 


644 


ZU5 . 


FT 


DOMAIN 


865 


94.3,_ 


Death. 


FT 


SITE 


412 


413 


Cleavage (by caspase-3) (By similarity) 


FT 


SITE 


707 


725 


Interaction with DCC (By similarity) . 


FT 


DISULFID 


69 


128 


By similarity. 


FT 


DISULFID 


174 


225 


By similarity. 


FT 


CARBOHYD 


222 


222 


N-linked (GlcNAc. . .) (Potential). 


FT 


CARBOHYD 


347 


347 


N-linked (GlcNAc. . .) (Potential). 


FT 


VARSPLIC 


356 


367 


NQRTLNDPKSHP -> T (in isoform 2). 


FT 








/FTId=VSP_011699. 


FT 


CONFLICT 


238 


238 


T -> A (in Ref . 2) . 


FT 


CONFLICT 


394 


394 


V -> E (in Ref. 2) . 


FT 


CONFLICT 


679 


679 


T -> S (in Ref. 2) . 



FT CONFLICT 874 874 N -> D (in Ref. 2). 

SQ SEQUENCE 945 AA; 103738 MW; 80E896F0F0E06012 CRC64; 



Query Match 53.8%; Score 2578.5; DB 1; Length 945; 

Best Local Similarity 53.2%; Pred. No. 4.5e-185; 

Matches 506; Conservative 150; Mismatches 235; Indels 61; Gaps 15; 

Qy 1 MAVRPGLWPALLGIVLAAW LRG — S GAQQ S ATVAN P VP GAN P DLL P H FLVE P E DV 53 

I II: III : I I II I I : : II : I I : I I : I I : I 
Db 1 MRARSGVRSALLLALLLCWDPTPSLAGVDSAGQ VLPDSYPSAPAEQLPYFLLEPQDA 57 

Qy 54 YIV7CNKPVXLVCKAVPATQIFFKCNGEWW 113 

MINIM I Ml 11111:11111111 I IN : I I ::U NM MINI 
Db 58 YIVKNKPVELHCRAFPATQIYFKCNGEWVSQNDHVTQESLDEATGLRVREVQIEVSRQQV 117 

Qy 114 EKVFGLEEYWCQCVAWSSSGTTKSQKAYIRIARLRKNFEQEPLAKEVSLEQGIVLPCRPP 173 

MM I I Ml I I I I I I I I I I I I I I MM I I I I I I 1 I II : I I I I I I E I |: :M I II I 
Db 118 EELFGLEDYWCQCVAWS S SGTTKS RRAYI RI AYLRKNFDQEPLAKEVPLDHEVLLQCRPP 177 

Qy 174 EGIPPAEV^WLRNEDLVT)PSLDPNWITREHSLVVRQARIJ\DTANYTCVA 233 

NM I I I I M M I M M I : I I : I M M : M I I II M I I I I I I I I II II I M I I 
Db 178 EGVP VAEVEWL KN E DVI D P AQ DTN FL LT I DHN L 1 1 RQARL S DTAN YT CVAKN I VAKRRST 237 

Qy 234 SAAVIVYWGGWSTWTEWSVCSASCGRGWQKRSRSCTNPAPLNGGAFCEGQNVQKTACAT 293 

M I I I I I I I I I I M III II I II II I I I M M I I I II I I I I I I I II I I I I I I I 
Db 238 TATVIVYVNGGWSSWAEWSPCSNRCGRGWQKRTRTCTNPAPLNGGAFCEGQAFQKTACTT 297 

Qy 294 LCPVDGSWSPWSKWSACGLDCTHWRSRECSDPAPRNGGEECQGTDLDTRNCTSDLCV 350 

M M I I : I : I i I I I I I :| I I I I I I I I Mill M II I I : : ! I I III 
Db 298 VCPVT)GAWTEWSKWSACSTECAHWRSRECMAPPPQNGGRDCSGTLLDSKNCTDGLCVLNQ 357 

Qy 351 HS AS GP EDVAL YVGL- 1 AVAVCLVLLLLVL I LVYCRKKEGLDS DVADS S - 1 L 400 

I I II I I II M I I : M : I :: I I I MM III I 

Db 358 RTLND P KS H P LET S GDVAL YAGL VVAVFWVAVLMAVGVI VYRRNCRD FDT D I T D S S AAL 417 

Qy 401 T S GFQ P VS I KP S KADNPHLL — TIQPDLSTTTTTYQGSLCPRQDGPSPKFQLTNGHLLSP 458 

I II I M I : : M I II : MM: I : I : | | : | : | | Ml 
Db 418 TGGFHPVNFKTARPNNPQLLHPSAPPDLTASAGIYRGPVYALQDS-ADKIPMTNSPLLDP 476 

Qy 459 LGGGRHTLHHSS PTSEAEEFVSRLSTQNYFRS 490 

I : : : M I I : Ml): : I I 

Db 477 LPSLKIKVYNSSTIGSGSGLADGADLLGVLPPGTYPGDF-SRDTHFLHLRSASLGSQHLL 535 

Qy 491 -LPRGTSNMTYGTFNFLGGRLMIPNTGISLLIPPDAIPRGKIYEIYLTLHKPEDVRLPLA 549 

I II M III I I II I M I I : I I M I I I I : I I I : : 1 | : : | I | | | : 
Db 536 GLPRDPSSSVSGTFGCLGGRLSLP.GTGVSLLVPNGAIPQGKFYDLYLHINKAEST-LPLS 594 

Qy 550 -GCQTLLSPIVSCGPPGVLLTRPVI LAMDHCGEPSPDSWSLRLKKQSCEGSWEDVLHLGE 608 

I I I : II I I : I I I M I I II I : I : II I I Ml M M MM: | | 

Db 595 EGSQTvTiSPSVT-CGPTGLLLCRPVA^TVPHCAEVIAGDWIFQLKTQAHQGHWEEWTLDE 654 

Qy 609 EAPSHLYYCQLEASACYVFTEQLGRFALVGEALSVAAAKRLKLLLFAPVACTSLEYNIRV 668 

I : MINIM:: MM: MM I M I I M I MM I I II I M : I I 
Db 655 ETLNTPCYCQLEAKSCHILLDQLGTYVFMGESYSRSAVKRLQLAIFAPALCTSLEYSLRV 714 

Qy 669 YCLHDTHDALKEWQLEKQLGGQLIQEPRVLHFKDSYHNLRLSIHDVPSSLWKSKLLVSY 728 

III II I I I I I : : I I : III MMM I II II I I II I I M I M I : M I I I I I 



Db 



715 YCLEDTPVALKEVLELERTLGGYLVEEPKPLLFKDSYHNLRLSLHDIPHAHWRSKLLAKY 774 



Qy 729 QEIPFYHIWNGTQRYLHCTFTLERVSPSTSDLACKLWVWQVEGDGQSFSINFNITKDTRF 788 

I I I I I I I : I I I : I I I II II MM I :::: I I : I Nihil I : : : :| 

Db 775 QEIPFYHVWNGSQRALHCTFTLERHSLASTEFTCKVCVRQVEGEGQIFQLHTTLA-ETPA 833 

Qy 789 AELLALESEAG — VPALVGPSAFKIPFLIRQKIISSLDPPCRRGADWRTLAQKLHLDSHL 846 

I II I I : Ml Mill Mill MM I II III Mill M M 

Db 834 GSLDALCSAPGNAITTQLGPYAFKIPLSIRQKICSSLDAPNSRGNDWRLLAQKLSMDRYL 893 

Qy 847 SFFASKPSPTAMI LNLWEARHFPNGNLSQLAAAVAGLGQPDAGLFTVSEAEC 898 

:: I I :| I I I :| I :| I I I I MM: MM: : | : : : :: :| 

Db 894 NYFATKAS PTGVI LDLWEARQQDDGDLNS LASALEEMGKS EMLVAMATDGDC 945 

RESULT 10 
UN5B_RAT 

ID UN5B_RAT STANDARD; PRT; 945 AA. 

AC 008722; 

DT 25-OCT-2004 (Rel. 45, Created) 

DT 25-OCT-2004 (Rel. 45, Last sequence update) 

DT 25-OCT-2004 (Rel. 45, Last annotation update) 

DE Netrin receptor UNC5B precursor (Unc-5 homolog B) (Unc-5 homolog 2) . 

GN Name=Unc5b; Synonyms=Unc5h2 ; 

OS Rattus norvegicus (Rat) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi; Muridae; Murinae; Rattus. 

OX NCBI_TaxID=10116; 

RN [1] 

RP SEQUENCE FROM N.A., FUNCTION, SUBCELLULAR LOCATION, AND TISSUE 

RP SPECIFICITY. 

RX MEDLINE=97271897; PubMed=9126742 ; 

RA Leonardo E.D., Hinck L. , Masu M. , Keino-Masu K. , Ackerman S.L., 

RA Tessier-Lavigne M. ; 

RT "Vertebrate homologues of C. elegans UNC-5 are candidate netrin 

RT receptors."; 

RL Nature 386:833-838(1997). 

RN [2] 

RP FUNCTION, AND INTERACTION WITH DCC. 

RX PubMed=10399920; 

RA Hong K. , Hinck L., Nishiyama M. , Poo M.-M., Tessier-Lavigne M. , 

RA Stein E. ; 

RT "A ligand-gated association between cytoplasmic domains of UNC5 and 

RT DCC family receptors converts netrin-induced growth cone attraction to 

RT repulsion."; 

RL Cell 97:927-941 (1999) . 

RN [3] 

RP FUNCTION, AND MUTAGENESIS OF ASP-412. 

RX PubMed=11387206; DOI=10 . 1093/embo j /20 . 11 . 2715; 

RA Llambi F., Causeret F. , Bloch-Gallego E., Mehlen P.; 

RT "Netrin-1 acts as a survival factor via its receptors UNC5H and DCC"; 

RL EMBO J. 20:2715-2722(2001). 

CC -!- FUNCTION: Receptor for netrin required for axon guidance. Mediates 

CC axon repulsion of neuronal growth cones in the developing nervous 

CC system upon ligand binding. Axon repulsion in growth cones may be 

CC caused by its association with DCC that may trigger signaling for 

CC repulsion. It also acts as a dependence receptor required for 



CC apoptosis induction when not associated with netrin ligand. 

CC SUBUNIT: Interacts with GNAI2 via its cytoplasmic part (By 

CC similarity) . Interacts with the cytoplasmic part of DCC. 

CC -!- SUBCELLULAR LOCATION: Type I membrane protein. 

CC -!- TISSUE SPECIFICITY: Mainly expressed in regions of differentiating 

CC neurons. Expressed in the developing sensory ganglia that flank 

CC the spinal cord from E12, peaking at E14. Expressed in the roof 

CC plate region of the spinal cord from E14. 

CC -!- PTM: Phosphorylated on cytoplasmic tyrosine residues (By 

CC similarity) . 

CC -!- PTM: Proteolytically cleaved by caspases during apoptosis. The 
CC cleavage does not take place when the receptor is associated with 

CC netrin ligand. Its cleavage by caspases is required to induce 

CC apoptosis. 

CC -!- SIMILARITY: Belongs to the UNC-5 family. 

CC -!- SIMILARITY: Contains 1 death domain. 

CC -!- SIMILARITY: Contains 1 immunoglobulin-like C2-type domain. 

CC -!- SIMILARITY: Contains 1 immunoglobulin-like domain. 

CC -!- SIMILARITY: Contains 2 TSP type-1 domains. 

CC -!- SIMILARITY: Contains 1 ZU5 domain. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; U87306; AAB57679.1; -. 

DR HSSP; P07996; 1LSL. 

DR RGD; 621756; Unc5h2 . 

DR InterPro; IPR000488; Death. 

DR InterPro; IPR011029; DEATH_like. 

DR InterPro; IPR007110; Ig-like. 

DR InterPro; IPR003598; Ig_c2. 

DR InterPro; IPR000884; TSP1. 

DR InterPro; IPR008085; TSP_1. 

DR InterPro; IPR000906; ZU5 . 

DR Pfam; PF00531; Death; 1. 

DR Pfam; PF00047; ig; 1. 

DR Pfam; PF00090; TSP_1; 2. 

DR Pfam; PF00791; ZU5; 1. 

DR PRINTS; PR01705; TSP1REPEAT. 

DR SMART; SM00005; DEATH; 1. 

DR .SMART; SM00408; IGc2; 1. 

DR SMART; SM00209; TSP1; 2. 

DR SMART; SM00218; ZU5; 1. 

DR PROSITE; PS50017; DEATH_DOMAIN; 1. 

DR PROSITE; PS50835; IG_LIKE; 1. 

DR PROSITE; PS50092; TSP1; 2. 

KW Apoptosis; Developmental protein; Immunoglobulin domain; 

KW Phosphorylation; Receptor; Repeat; Signal; Transmembrane. 

FT SIGNAL 1 26 Potential. 

FT CHAIN 27 945 Netrin receptor UNC5B. 

FT DOMAIN 27 377 Extracellular (Potential). 

FT TRANSMEM 378 398 Potential. 



FT 


DOMAIN 


399 


945 


*-y i-op±asnu.c irotenciaij . 




DOMATN 


4ft 


-L 1 J 


Trr — 1 i to 


FT 

J. A 


DOMATN 


1 S3 


?4? 


J.g iiKe L/Z:— uype . 


FT 


DOMAIN 


246 


300 


TSP tvr>e-l 1 


FT 


DOMAIN 


Jul. 


3S4 
«j j t 


ioc type i ^. 


FT 


DOMAIN 


541 


64 4 

\J*± *± 


uUJ. 


FT 


DOMAIN 

X^ V-/ i. Ail 1 J. H 


865 


943 




FT 


SITE 


412 


413 

T JL O 


Cleavage \Dy caspase o; • 


FT 


SITE 


707 


7? S 


lnceiacuion wicn li^l>. 


FT 


DISULFID 


69 


128 


uy 0 _i_ i i l j_ xa j l i— y • 


FT 


DISULFID 


174 


225 


R\7 qi mi 1 a ri 

Lj y 0 XilL-L Ictll Uy • 


FT 


CARBOHYD 

V**X XX \ i_/ N-^ X A X X-^ 


222 


222 


"NT— "1 -j wVpH / f^l rMttr 1 \ / _ -1 \ 
in xxhjvcu \ vjx oiN-rvt— • • . ) ^ rouentlai, ) • 


FT 


CARBOHYD 


347 


347 


N-linked (GlcNAc. . . ) (Potential). 


FT 


MUTAGEN 


412 


412 


D->N: Abolishes cleavage by caspase-3 and 


FT 








subsequent induction of apoptosis . 


SQ 


SEQUENCE 


945 AA; 


103520 


MW; 6E9C2A262E560B9B CRC64 ; 



Query Match 53.8%; Score 2578.5; DB 1; Length 945; 

Best Local Similarity 53.0%; Pred. No. 4.5e-185; 

Matches 509; Conservative 142; Mismatches 231; Indels 79; Gaps 17; 

MAVRPGLWPALLGIVLAAW LRG — S GAQQ SAT VAN P VPGAN P DLL PH FLVE P ED V 53 

I M Ml : I I II Ml : : II : I I I I I : I I I I 
MRARSGARGALLLALLLCWDPTPSLAGIDSGGQ ALPDSFPSAPAEQLPHFLLEPEDA 57 

YIVT<NKPvXLVCKAVTATQIFFKCNGEWV^QVT)HVIERSTDGSSGLPTMEWI 113 
I I I I I II I I I : I I I I I I : I I I I I I I I | II : I I ::|| ||:| MINI 
YI VKNKPVELHCRAFPATQI YFKCNGEWVSQKGHVTQES LDEATGLRI REVQI EVS RQQV 117 

EKVFGLEEYWCQCVAWSSSGTTKSQKAYIRIARLRKNFEQEPLAKEVSLEQGIVLPCRPP 173 

hi III I II I It I I I I il::| III II I I I I I : I I I I I I I I I : ::| MM 
EELFGLEDYWCQCVAWSSSGTTKSRRAYIRIAYLRKNFDQEPLAKEVPLDHEVLLQCRPP 177 

EGIPPAEV^LRNEDLVIJPSLDPNWITREHSLvA^QARLADTANYTCV^^ 233 
MM I M I I I : I I I : : I I : I I : I : I : I : : I I I I I : I I I I I I I I I I I | I | : | | | 
EGVPVAEVEWLKNEDVIDPAQDTNFLLT I DHNLI I RQARLSDTANYTCVAKNI VAKRRST 237 

S AAVI VYVNGGW S TWT EW SVC S AS C GRGWQ KRS RS CTN PAP LNGGAFCEGQNVQ KT ACAT 293 
: I I I I I I I I I II : I III II I I I I I I I I : I : I I I I I I I I I I I I I I I I I I I I I I 
TATVI VYVNGGWS SWAEWS PCSNRCGRGWQKRTRTCTNPAPLNGGAFCEGQACQKTACTT 297 

LCPVDGSWSPWSKWSACGLDCTHWRSRECSDPAPRNGGEECQGTDLDTRNCTSDLCV 350 

: I I I I I : I : I M II II : I M II II I I Mill : I II I I : : I I I I I I 
VCPVI)GAWTEWSKWSACSTECAHWRSRECMAPPPQNGGRDCSGTLLDSKNCTDGLCvIiNQ 357 

HSASGPE DVALYVGL- IAVAVCLVLLLLVLI LVYCRKKEGLDSDVADSS-I L 400 

: = I : I I I I I I I : I I I I : I : I : : I I I I : I : III I 

RTLNDPKSRPLEPSGDVALYAGLWAVFVVLAVlJyiAVGVIvTRRNCR 417 

T S GFQP VS I KP S KADN PHLL — TIQPDLSTTTTTYQGSLCPRQDGPSPKFQLTNGHLLSP 458 
llllhl:: I I I I : MM: | : | : | | : | Ml III 
TGGFHPVNFKTARPSNPQLLHPSAPPDLTASAGIYRGPVYALQDS-ADKIPMTNSPLLDP 476 

L GGG RHTLHHSSPTSEAEEFVS 480 

I M I I I I : 



Qy 


1 


Db 


1 


Qy 


54 


Db 


58 


Qy 


114 


Db 


118 


Qy 


174 


Db 


178 


Qy 


234 


Db 


238 


Qy 


294 


Db 


298 


Qy 


351 


Db 


358 


Qy 


401 


Db 


418 


Qy 


459 


Db 


477 



Qy 

Db 



481 RLSTQNYFRSLPRGTSNMTYGTFNFLGGRLMI PNTGISLLIPPDAIPRGKIYEIYLTLHK 540 

I :| : Ml I: III I I I I I I I I I : I I I : I I I I : I I I : : I I : : I 
52 8 S LGSQ-HLLGLPRDPS S S VSGTFGCLGGRLT I PGTGVSLLVPNGAI PQGKFYDLYLRINK 586 



Qy 541 PEDVRLPLA-GCQTLLSPIVSCGPPGVLLTRPVILAMDHCGEPSPDSWSLRLKKQSCEGS 599 

I I I I : I I I : I I I I : I I I I : I I I I I : I : I I I I : I I I : : I 

Db 587 TEST-LPLSEGSQTVLSPSVTCGPTGLLLCRPVVLTVPHCAEVIAGDWIFQLKTQAHQGH 645 

Qy 600 WEDVLHLGEEAPSHLYYCQLEASACYVFTEQLGRFALVGEALSVAAAKRLKLLLFAPVAC 659 

l|:|: I II : MINI :|:: :|ll : I I : I : I I I I : I : I I I I 
Db 646 WEEWTLDEETLNT PC YCQLEAKS CHI LLDQLGT YVFTGES YS RSAVKRLQLAI FAPALC 705 

Qy 660 TSLEYNIRVYCLHDTHDALKEWQLEKQLGGQLIQEPRVLHFKDSYHNLRLSIHDVPSSL 719 

I I I I I : : I I II I II I I I I I : : I I : III I :: I I : I I I I I I I I I I I I : I I : I : 
Db 706 TSLEYSLRVYCLEDTPAALKEVLELERTLGGYLVEEPKTLLFKDSYHNLRLSLHDIPHAH 765 

Qy 720 WKSKLLVSYQEIPFYHIWNGTQRYLHCTFTLERVSPSTSDLACKLWVWQVEGDGQSFSIN 779 

I : II I I I I I I I II I : I I I : I : I I I I I I I I I I :::: I I : I Nihil I : : 
Db 766 WRSKLLAKYQEIPFYHVWNGSQKALHCTFTLERHSLASTEFTCKV^ 825 

Qy 780 FNITKDTRFAELLALESEAGVPAL — VGPSAFKIPFLIRQKIISSLDPPCRRGADWRTLA 837 

: : I I I I I I I : I I I I I I I I I I I I : I I I I II I I I I I 
Db 826 TTLA-ETPAGSLDALCSAPGNAATTQLGPYAFKIPLSIRQKICNSLDAPNSRGNDWRLLA 884 

Qy 838 QKLHLDSHLSFFASKPSPTAMILNLWEARHFPNGNLSQLAAAVAGLGQPDAGLFTVSEAE 897 

I I I : I : I : : I I : I I I I : I I : I I I I I : I : I : I I : I : : I : : : : : : 
Db 885 QKLSMDRYLNYFATKASPTGVILDLWEARQQDDGDLNSI^ALEEMGKSEMLVAMTTDGD 944 

Qy 898 C 898 

I 

Db 945 C 945 



RESULT 11 
UN5B_HUMAN 

ID UN5BJiUMAN STANDARD; PRT; 945 AA. 

AC Q8IZJ1; Q86SN3; Q8N1Y2; Q9H9F3; 

DT 25-OCT-2004 (Rel. 45, Created) 

DT 25-OCT-2004 (Rel. 45, Last sequence update) 

DT 25-OCT-2004 (Rel. 45, Last annotation update) 

DE Netrin receptor UNC5B precursor (Unc-5 homolog B) (Unc-5 homolog 2) 

DE (p53-regulated receptor for death and life protein 1) 

DE (UNQ1883/PR04326) . 

GN Name=UNC5B; Synonyms=P53RDLl, UNC5H2; 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; .Chordata; Craniata; Vertebrata; Euteleostomi; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP SEQUENCE FROM N.A. (ISOFORM 2), TISSUE SPECIFICITY, AND INTERACTION 

RP WITH GNAI2. 

RC TISSUE=Lung; 

RX MEDLINE=22246081; PubMed=12359238 ; DOI=10 . 1016/S0006-291X ( 02 ) 02277-5; 

RA Komatsuzaki K. , Dalvin S. f Kinane T.B.; 

RT "Modulation of G(ialpha(2)) signaling by the axonal guidance molecule 

RT UNC5H2 . " ; 

RL Biochem. Biophys . Res. Commun. 297:898-905(2002). 



RN [2] 

RP SEQUENCE FROM N. A. (ISOFORM 1) , FUNCTION, AND MUTAGENESIS OF ASP-412. 

RX PubMed=12598906; DOI=10 . 1038/ncb943; 

RA Tanikawa C, Matsuda K. , Fukuda S., Nakamura Y., Arakawa H.; 

RT f, p53RDLl regulates of p53-dependent apoptosis . " ; 

RL Nat. Cell Biol. 5:216-223(2003). 

RN [3] 

RP SEQUENCE FROM N . A. (ISOFORM 1). 

RX MEDLINE=22887296; PubMed=12975309; DOI=10 . 1101/gr . 1293003; 

RA Clark H.F., Gurney A.L., Abaya E., Baker K. , Baldwin D., Brush J., 

RA Chen J. , Chow B., Chui C, Crowley C, Currell B., Deuel B., Dowd P., 

RA Eaton D., Foster J., Grimaldi C, Gu Q., Hass P.E., Heldens S., 

RA Huang A. , Kim H.S., Klimowski L., Jin Y. , Johnson S., Lee J. , 

RA Lewis L., Liao D. , Mark M. , Robbie E. , Sanchez C, Schoenfeld J., 

RA Seshagiri S., Simmons L., Singh J., Smith V., Stinson J., Vagts A. , 

RA Vandlen R. , Watanabe C, Wieand D., Woods K., Xie M.-H., Yansura D., 

RA Yi S . , Yu G . , Yuan J. , Zhang M. , Zhang Z., Goddard A., Wood W.I., 

RA Godowski P., Gray A.; 

RT "The secreted protein discovery initiative (SPDI) , a large-scale 

RT effort to identify novel human secreted and transmembrane proteins: a 

RT bioinformatics assessment."; 

RL Genome Res. 13:2265-2270(2003). 

RN [4] 

RP SEQUENCE OF 361-945 FROM N.A. 

RC TISSUE=Amygdala, and Teratocarcinoma; 

RX PubMed=14702039; DOI=10 . 1038/ngl285; 

RA Ota T., Suzuki Y., Nishikawa T., Otsuki T., Sugiyama T., Irie R., 

RA Wakamatsu A. , Hayashi K., Sato H., Nagai K., Kimura K. , Makita H., 

RA Sekine M. , Obayashi M. , Nishi T., Shibahara T., Tanaka T., Ishii S., 

RA Yamamoto J. -I., Saito K., Kawai Y., Isono Y., Nakamura Y. , 

RA Nagahari K., Murakami K., Yasuda T., Iwayanagi T., Wagatsuma M., 

RA Shiratori A. , Sudo H., Hosoiri T., Kaku Y. , Kodaira H., Kondo H., 

RA Sugawara M. , Takahashi M. , Kanda K. , Yokoi T., Furuya T. , Kikkawa E., 

RA Omura Y., Abe K. , Kamihara K. , Katsuta N. , Sato K., Tanikawa M., 

RA Yamazaki M. , Ninomiya K., Ishibashi T. , Yamashita H., Murakawa K. , 

RA Fujimori K. , Tanai H., Kimata M. , Watanabe M. , Hiraoka S., Chiba Y., 

RA Ishida S., Ono Y., Takiguchi S., Watanabe S., Yosida M., Hotuta T., 

RA Kusano J. , Kanehori K., Takahashi-Fujii A. , Hara H., Tanase T.-O., 

RA Nomura Y. , Togiya S., Komai F., Hara R. , Takeuchi K., Arita M. , 

RA Imose N., Musashino K., Yuuki H., Oshima A., Sasaki N., Aotsuka S., 

RA Yoshikawa Y. , Matsunawa H., Ichihara T., Shiohata N. , Sano S., 

RA Moriya S., Momiyama H., Satoh N., Takami S., Terashima Y., Suzuki O., 

RA Nakagawa S., Senoh A., Mizoguchi H., Goto Y., Shimizu F., Wakebe H., 

RA Hishigaki H., Watanabe T., Sugiyama A. , Takemoto M. , Kawakami B., 

RA Yamazaki M. , Watanabe K., Kumagai A. , Itakura S., Fukuzumi Y., 

RA Fujimori Y. , Komiyama M., Tashiro H., Tanigami A., Fujiwara T., 

RA Ono T., Yamada K. , Fujii Y., Ozaki K. , Hirao M. , Ohmori Y. , 

RA Kawabata A. , Hikiji T., Kobatake N., Inagaki H., Ikema Y., Okamoto S., 

RA Okitani R. , Kawakami T., Noguchi S., Itoh T., Shigeta K., Senba T., 

RA Matsumura K. , Nakajima Y. , Mizuno T., Morinaga M., Sasaki M. , 

RA Togashi T., Oyama M. , Hat a H., Watanabe M. , Komatsu T., 

RA Mizushima-Sugano J., Satoh T., Shirai Y., Takahashi Y. , Nakagawa K. , 

RA Okumura K., Nagase T., Nomura N. f Kikuchi H., Masuho Y. , Yamashita R. , 

RA Nakai K., Yada T., Nakamura Y., Ohara O., Isogai T., Sugano S.; 

RT "Complete sequencing and characterization of 21,243 full-length human 

RT cDNAs . " ; 

RL Nat. Genet. 36:40-45(2004). 



RN [5] 

RP DOWN-REGULATION IN CANCER. 

RX PubMed=12 655055; DOI=10 . 1073/pnas . 0738063100; 

RA Thiebault K. , Mazelin L., Pays L., Llambi F. , Joly M.-O., 

RA Scoazec J.-Y., Saurin J.-C, Romeo G., Mehlen P.; 

RT "The netrin-1 receptors UNC5H are putative tumor suppressors 

RT controlling cell death commitment."; 

RL Proc. Natl. Acad. Sci. U.S.A. 100:4173-4178(2003). 

CC -!- FUNCTION: Receptor for netrin required for axon guidance. Mediates 
CC axon repulsion of neuronal growth cones in the developing nervous 

CC system upon ligand binding. Axon repulsion in growth cones may be 

CC caused by its association with DCC that may trigger signaling for 

CC repulsion. It also acts as a dependence receptor required for 

CC apoptosis induction when not associated with netrin ligand. 

CC -!- SUBUNIT: Interacts with the cytoplasmic part of DCC (By 
CC similarity) . Interacts with GNAI2 via its cytoplasmic part. 

CC -!- SUBCELLULAR LOCATION: Type I membrane protein (By similarity). 

CC -!- ALTERNATIVE PRODUCTS: 

CC Event=Alternative splicing; Named isoforms=2; 

CC Name=l ; 

CC Isoid=Q8IZJl-l; Sequence=Displayed; 

CC Name=2 ; 

CC IsoId=Q8IZJl-2; Sequence=VSP_011698 ; 

CC -!- TISSUE SPECIFICITY: Highly expressed in brain. Also expressed at 
CC lower level in developing lung f cartilage, kidney and 

CC hematopoietic and immune tissues. 

CC -!- INDUCTION: By p53/TP53. 

CC -!- PTM: Phosphorylated on cytoplasmic tyrosine residues (By 
CC similarity) . 

CC -!- PTM: Proteolytically cleaved by caspases during apoptosis. The 
CC cleavage does not take place when the receptor is associated with 

CC netrin ligand. Its cleavage by caspases is required to induce 

CC apoptosis. 

CC -!- MISCELLANEOUS: Down-regulated in multiple cancers including 
CC colorectal, breast, ovary, uterus, stomach, lung, or kidney 

CC cancers. 

CC -!- SIMILARITY: Belongs to the UNC-5 family. 

CC -!- SIMILARITY: Contains 1 death domain. 

CC -!- SIMILARITY: Contains 1 immunoglobulin-like C2-type domain. 

CC -!- SIMILARITY: Contains 1 immunoglobulin-like domain. 

CC -!- SIMILARITY: Contains 2 TSP type-1 domains. 

CC -!- SIMILARITY: Contains 1 ZU5 domain. 

CC ; ■ 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. ..There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 



CC 

DR EMBL; AY126437; AAM95701.1; -. 

DR EMBL; AB096256; BAC57998.1; 

DR EMBL; AY358351; AAQ88717.1; 

DR EMBL; AK022859; BAB14276.1; ALT_INIT. 

DR EMBL; AK094595; BAC04382.1; ALT_INIT. 

DR HSSP; P07996; 1LSL. 



DR Genew; HGNC: 12568; UNC5B. 

DR MIM; 607870; -. 

DR InterPro; IPR000488; Death. 

DR InterPro; IPR011029; DEATHJ_ike. 

DR InterPro; IPR007110; Ig-like. 

DR InterPro; IPR003598; Ig_c2 . 

DR InterPro; IPR000884; TSP1. 

DR InterPro; IPR008085; TSP_1. 

DR InterPro; IPR000906; ZU5. 

DR Pfam; PF00531; Death; 1. 

DR Pfam; PF00047; ig; 1. 

DR Pfam; PF00090; TSP_1; 2. 

DR Pfam; PF00791; ZU5; 1. 

DR PRINTS; PR01705; TSP1REPEAT. 

DR SMART; SM00005; DEATH; 1. 

DR SMART; SM00408; iGc2; 1. 

DR SMART; SM00209; TSP1; 2. 

DR SMART; SM00218; ZU5; 1. 

DR PROSITE; PS50017; DEATH_DOMAIN; FALSE_NEG. 

DR PROSITE; PS50835; I G_LI KE ; 1. 

DR PROSITE; PS50092; TSP1; 2. 

KW Alternative splicing; Apoptosis; Developmental protein; 

KW Immunoglobulin domain; Phosphorylation; Polymorphism; Receptor; 

KW Repeat; Signal; Transmembrane. 



FT 


SIGNAL 


1 


26 


Potential. 


FT 


CHAIN 


27 


945 


Netrin receptor UNC5B. 


FT 


DOMAIN 


27 


377 


Extracellular (Potential) . 


FT 


TRANSMEM 


378 


398 


Potential. 


FT 


DOMAIN 


399 


945 


Cytoplasmic (Potential) . 


FT 


DOMAIN 


48 


145 


Ig-like . 


FT 


DOMAIN 


147 


242 


Ig-like C2-type. 


FT 


DOMAIN 


246 


300 


TSP type-1 1. 


FT 


DOMAIN 


302 


354 


TSP type-1 2. 


FT 


DOMAIN 


541 


644 


ZU5. 


FT 


DOMAIN 


865 


943 


Death. 


FT 


SITE 


412 


413 


Cleavage (by caspase-3) . 


FT 


SITE 


707 


725 


Interaction with DCC (By similarity) . 


FT 


DISULFID 


69 


128 


By similarity. 


FT 


DISULFID 


174 


225 


By similarity. 


FT 


CARBOHYD 


222 


222 


N-linked (GlcNAc. . .) (Potential). 


FT 


CARBOHYD 


347 


347 


N-linked (GlcNAc. . .) (Potential). 


FT 


VARSPLIC 


356 


367 


NKKTLSDPNSHL -> M (in isoform 2) . 


FT 








/FTId=VSP 011698. 


FT 


VARIANT 


516 


516 


A -> T (in dbSNP: 10509332) . 


FT 








/FTId=VAR_0 19730. 


FT 


.^MUTAGEN 


412 


412 


D->N: Abolishes cleavage by caspase-3 


FT 








subsequent induction of apoptosis. 


FT 


CONFLICT 


483 


483 


K -> E (in Ref . 3) . 


FT 


CONFLICT 


851 


851 


L -> P (in Ref. 3; BAB14276) . 


SQ 


SEQUENCE 


945 AA; 


103637 


MW; 56064E335F323447 CRC64 ; 



Query Match 53.4%; Score 2558.5; DB 1; Length 945; 

Best Local Similarity 52.7%; Pred. No. 1.5e-183; 

Matches 501; Conservative 148; Mismatches 244; Indels 57; Gaps 15; 

Qy 1 MAVRPGLWPALLGIVLAAW LRGS GAQQ - S ATVAN P VP GAN P D L L PH FLVE P EDVYI V 56 

III 111:11 I : I I : : I I : I I : I I I I : I I I I 



Db 1 MGARSGARGALLLALLLCWDPRLSQAGTDSGSEVLPDSFPSAPAEPLPYFLQEPQDAYIV 60 



Qy 57 KNKPVLLVCKAVPATQI FFKCNGEWVRQVDHVT ERSTDGS SGLPTMEVRINVS RQQVEKV 116 

INN I 1:1 I I I I I : I I I I I I I I I Ml : | ::|| ||:| |||||||:: 
Db 61 KNKPVELRCRAFPATQIYFKCNGEWVSQNDHVTQEGLDEATGLRVREVQIEVSRQQVEEL 120 

Qy 117 FGLEEYWCQCVAWSSSGTTKSQKAYIRIARLRKNFEQEPLAKEVSLEQGIVLPCRPPEGI 176 

Ml 1:11 I I Ml I I 1:1 I I I 1:1 I I I I I I F : 1 i I I III |: ::| I II III: 

Db 121 FGLEDYWCQCVAWSSAGTTKSRRAYVRIAYLRKNFDQEPLGKEVPLDHEVLLQCRPPEGV 180 

Qy 177 PPAEVEWLRNEDLVDPSLDPNVYITREHSLVWQARLAD 236 

I I I I I I I : I I I : : I I : I I : | : I : I :: I I I I I : I I I II I I I I I I I I I : I II : I 
Db 181 PVAEVEWLKNEDVT DPTQDTNFLLTI DHNLI I RQARLS DTAN YTCVAKNI VAKRRSTTAT 240 

Qy 237 VIVYWGGWSTWTEWSVCSASCGRGWQKRSRSCTNPAPLNGGAFCEGQNVQKTACATLCP 296 

1111111111:1 III II I I I I I I I |:|:| I II I M II I II I I I I Mill I : I I 
Db 241 VIVYVNGGWSSWAEWSPCSNRCGRGWQKRTRTCTNPAPLNGGAFCEGQAFQKTACTTICP 300 

Qy 297 VDGSWS PW SKWSACGLDCTHWRS RECS DPAPRNGGEECQGTDLDTRNCTS DLCVHS 352 

I M : I : II I I I I I : I I I I I I I I I I : I I I : I I I I I :: | | | I I : : 

Db 301 VDGAWTEWSKWSACSTECAPIWRSRECMAPPPQNGGRDCSGTLLDSKNCTDGLCMQNKKTL 360 

Qy 353 ASGPEDVALYVGL- 1 AVAVCLVLLLLVLI LVYCRKKEGLDS DVADS S - 1 LT 401 

III I III II :|: I : :|: I ::|| I |:|:, III II 

Db 361 SDPNSHLLEASG — DAALYAGLWAI FVWAILMAVGVWYRRNCRDFDTDITDSSAALT 418 

Qy 402 SGFQPVSIKPSKADNPHLL — TIQPDLSTTTTTYQGSLCPRQDGPSPKFQLTNGHLLSPL 459 

I I I I : I : : I I I I :: | | | : : | : | : | | : | : | | | | | | 
Db 419 GGFHPVNFKTARPSNPQLLHPSVPPDLTASAGIYRGPVYALQDS-TDKIPMTNSPLLDPL 477 

Qy 460 GGGRHTLHHSSPT SEAEEFVSRLSTQNY FRS L 491 

: : : I I I : : : : | | j | | 

Db 478 PSLKVKVYSSSTTGSGPGLADGADLLGVLPPGTYPSDFARDTHFLHLRSASLGSQQLLGL 537 

Qy 492 PRGTSNMTYGTFNFLGGRLMIPNTGISLLIPPDAIPRGKIYEIYLTLHKPEDVRLPLA-G 550 

II : Ml Mill II Ihlllrl II I : II II : II : : I I III: I 

Db 538 PRDPGSSVSGTFGCLGGRLSIPGTGVSLLVPNGAIPQGKFYEMYLLINKAEST-LPLSEG 596 

Qy 551 CQTLLSPIVSCGPPGVLLTRPVIIAMDHCGEPSPDSWSLRLKKQSCEGSWEDVLHLGEEA 610 

MMII I : II I I : II I I I I I I I I I I I :|| |: :| l|:|: I II 
Db 597 TQTVLSPSVTCGPTGLLLCRPVT LTMPHCAEVSARDWIFQLKTQAHQGHWEEWTLDEET 656 

Qy 611 P S HL Y YCQLEAS AC YVFT EQLGRFALVGEALS VAAAKRLKLLLFAPVACT S LE YN I RVYC 670 

: Mill II:: :||| : I I : I : I I I I : I : I I I lllll|::||M 
Db 657 LNT PC YCQLEP RACH I LLDQLGT YVFTGES YS RS AVKRLQLAVFAPALCT S LEYS LRVYC 716 

Qy 671 LHDTHDALKEWQLEKQLGGQLIQEPRVLHFKDSYHNLRLSIHDVPSSLWKSKLLVSYQE 730 

I M II II I : : II : Ml I : : I I : I I I I I I I I I I I I : I I : I : I : It i I | | I 
Db 717 LEDTPVALKEVLELERTLGGYLVEEPKPLMFKDSYHNLRLSLHDLPHAHWRSKLLAKYQE 776 

Qy 731 IPFYHIWNGTQRYLHCTFTLERVSPSTSDLACKLWVWQVEGDGQSFSINFNITKDTRFAE 790 

I II I I I I : I : I : M I II I I I I I : : : : I I I : I Nihil I : : : : I 
Db 777 IPFYHIWSGSQKALHCTFTLERHSLASTELTCKICVRQVEGEGQIFQLHTTLA-ETPAGS 835 

Qy 791 LLALESEAG — VPALVGPSAFKIPFLIRQKIISSLDPPCRRGADWRTLAQKLHLDSHLSF 848 

I II I I : I I I I I I I I I II I : II I I I I I I I I I I I I : I : I : : 
Db 836 LDTLCSAPGSTVTTQLGPYAFKIPLSIRQKICNSLDAPNSRGNDWRMLAQKLSMDRYLNY 895 



Qy 849 FASKPSPTAMILNLWEARHFPNGNLSQLAAAVAGLGQPDAGLFTVSEAEC 898 



Db 896 FATKASPTGVILDLWEALQQDDGDLNSLASALEEMGKSEMLVAVATDGDC 945 



RESULT 12 
UN5D MOUSE 



ID UN5D_MOUSE STANDARD; PRT; 956 AA. 

AC Q8K1S2; 

DT 25-OCT-2004 (Rel. 45, Created) 

DT 25-OCT-2004 (Rel. 45, Last sequence update) 

DT 25-OCT-2004 (Rel. 45, Last annotation update) 

DE Netrin receptor UNC5D precursor (Unc-5 homolog D) (Unc-5 horaolog 4) . 

GN Name=Unc5d; Synonyms=Unc5h4 ; 

OS Mus mus cuius (Mouse) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Rodentia; Sciurognathi ; Muridae; Murinae; Mus. 

OX NCBI_TaxI D= 1 0 0 9 0 ; 

RN [1] 

RP SEQUENCE FROM N.A. , AND TISSUE SPECIFIICTY. 

RX MEDLINE=22239710; PubMed=12351186; DOI=10 . 1016/S0925-4773 ( 02 ) 00248-4 ; 

RA Engelkamp D.; 

RT "Cloning of three mouse Unc5 genes and their expression patterns at 

RT mid-gestation . " ; 

RL Mech. Dev. 118:191-197(2002). 

CC -!- FUNCTION: Receptor for netrin involved in cell migration. May be 

CC involved in axon guidance by mediating axon repulsion of neuronal 

CC growth cones in the developing nervous system upon ligand binding. 

CC Axon repulsion in growth cones may be caused by its association 

CC with DCC that may trigger signaling for repulsion. It also acts as 

CC a dependence receptor required for apoptosis induction when not 

CC associated with netrin ligand (By similarity) . 

CC -!- SUBUNIT: Interacts with the cytoplasmic part of DCC (By 

CC similarity) . 

CC -!- SUBCELLULAR LOCATION: Type I membrane protein (By similarity). 

CC -!- TISSUE SPECIFICITY: Expressed in developing limb and mammary 

CC gland. 

CC PTM: Phosphorylated on cytoplasmic tyrosine residues (By 

CC similarity) . 

CC -!- PTM: Proteolytically cleaved by caspases during apoptosis. The 

CC cleavage does not take place when the receptor is associated with 

CC netrin ligand. Its cleavage by caspases is required to. induce 

CC apoptosis (By similarity) . 

CC -!- SIMILARITY: Belongs to the UNC-5 family. 

CC -!- SIMILARITY: Contains 1 death domain. 

CC -!- SIMILARITY: Contains 1 immunoglobulin-like C2-type domain. 

CC -!- SIMILARITY: Contains 1 immunoglobulin-like domain. 

CC -!- SIMILARITY: Contains 2 TSP type-1 domains. 

CC -!- SIMILARITY: Contains 1 ZU5 domain. 

CC 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 



cc 
cc 

DR 


or send an email 


to license@isb-sib.ch) . 


EMBL; AJ487854; CAD32252.1; 


DR 


HSSP; P07996; 1LSL. 


DR 


MGD; MGI 


2389364; 


Unc5d. 


DR 


InterPro, 


: IPR000488; Death. 


DR 


InterPrd 


IPR011029; DEATH like. 


DR 


InterPro, 


> IPR007110; Ig-like. 


DR 


InterPro, 


f IPR003598; Ig c2. 


DR 


InterPro, 


• IPR000884; TSP1. 


DR 


InterPro , 


\ IPR008085; TSP_1. 


DR 


InterPro , 


* IPR000906; ZU5. 


DR 


Pfam; PF00531; Death; 1. 


DR 


Pfam; PF00047; ig 


; 1. 


DR 


Pfam; PF00090; TSP_1; 2. 


DR 


Pfam; PF00791; ZU5; 1. 


DR 


PRINTS; PR01705; 


TSP1REPEAT. 


DR 


SMART; SM00005; DEATH; 1. 


DR 


SMART; SM00408; IGc2; 1. 


DR 


SMART; SM00209; TSP1; 2. 


DR 


PROSITE; 


PS50017; 


D EAT H_DOMAI N ; FALSE__NEG. 




PROSITE; 


PS50835; 


IG_LIKE; 1. 


DR 


PROSITE; 


PS50092; 


TSP1; 2. 


KW 
r\w 


Apoptosis; Developmental protein; Immunoglobulin domain; 


rvw 


Phosphorylation; 


Receptor; Repeat; Signal; Transmembrane. 


FT 


SIGNAL 


1 


30 Potential. 


FT 


CHAIN 


31 


956 Netrin receptor UNC5D. 


FT 


DOMAIN 


31 


382 Extracellular (Potential) . 




TRANS MEM 


383 


403 Potential. 


FT 


DOMAIN 


404 


956 Cytoplasmic (Potential) . 


PT 

r x 


DOMAIN 


52 


149 Ig-like. 


r ± 


DOMAIN 


151 


242 Ig-like C2-type. 


FT 


DOMAIN 


250 


304 TSP type-1 1. 


FT 


DOMAIN 


306 


358 TSP type-1 2. 


FT 


DOMAIN 


543 


645 ZU5. 


FT 


DOMAIN 


862 


939 Death. 


FT 


SITE 


419 


420 Cleavage (by caspase-3) (By similarity) . 


FT 


SITE 


706 


724 Interaction with DCC (By similarity) . 


FT 


DISULFID 


73 


132 By similarity. 


FT 


DISULFID 


178 


229 By similarity. 


FT 


CARBOHYD 


115 


115 N-linked (GlcNAc. . .) (Potential). 


FT 


CARBOHYD 


226 


226 N-linked (GlcNAc. . .) (Potential). 


FT 


CARBOHYD 


351 


351 N-linked (GlcNAc. . .) (Potential). 


FT 


CARBOHYD 


379 


379 N-linked (GlcNAc. . .) (Potential). 


SQ 


SEQUENCE 


956 AA; 106351 MW; DFDF07839C10C68D CRC64; 



Query Match 45.9%; Score 2200; DB 1; Length 956; 

Best Local Similarity 45.5%; Pred. No. 1.5e-156; 

Matches 431; Conservative 159; Mismatches 280; Indels 78; Gaps 15; 

Qy 8 WPALLGIVLAAWLRGS GAQQSATVANPVPGANPDLLPHFLVEPEDVYIVKNKPVLL 63 

I II: I I : II I : : : I I I I I I I : I I I I I I : I : I : I 
Db 15 WLPWLGLFF— WAAGAAAARGADGSEILPDSIPSA-PGTLPHFIEEPEDAYIIKSNPIAL 71 



Qy 

Db 



64 VCKAVPATQI FFKCNGEWVRQVDHVI ERSTDGSSGLPTMEVRINVSRQQVEKVFGLEEYW 123 

II I I I I I I I I I I I I I I I : I I I I I I I I I II I I I : I I I I I I 1:11 
72 RCKARPAMQIFFKCNGEWVBQNEHVSEESLDESSGLKVl^VFINvTRQQVEDFHGPEDYW 131 



Qy 124 CQCVAWSSSGTTKSQKAYIRIARLRKNFEQEPIiAKEVSLEQGIVLPCRPPEGIPPAEVEW 183 

I I I I I I I I I : I I : I I : I I I I I I I I I I : I : I I : I II I I I I I I I : I I I I I I 
Db 132 CQCVAWSHLGTSKSRKASVRIAYLRKNFEQDPQGREVPIEGMIVLHCRPPEGVPAAEVEW 191 

Qy 184 LRNEDLVT)PSLDPNWITREHSLVWQAR 243 

I : I I : : I I I : : I : I : : I I I I I : I : I I I I : I I I I I : I I I II I : I I I I I 
Db 192 LKNEEPIDSEQDENIDTRADHNLIIRQARLSDSGNYTCMAANIVAKRRSLSATVWYVNG 251 

Qy 244 GWSTWTEWSVCSASCGRGWQKRSRSCTNPAPLNGGAFCEGQNVQKTACATLCPVDGSWSP 303 

111:11111 I : I I I I I I I I I I : I I I I II I I I I I I I I I : I I I I I I I I I I I I 
Db 252 GWSSWTEWSACNVRCGRGWQKRSRTCTNPAPLNGGAFCEGMSVQKITCTALCPVDGSWEV 311 

Qy 304 WSKWSACGLDCTHWRSRECSDPAPRNGGEECQGTDLDTRNCTSDLCV 350 

I I : I I I : I I I I I I : I I I I II : I : I :: I I I I I : 
Db 312 WSEWSVCSPECEHLRIRECTAPPPRNGGKFCEGLSQESENCTDGLCILDKKPLHEIKPQR 371 

Qy 351 H SAS GP EDVALYVGL I AVAVCLVLLLLVLI LVYCRKKEGLDS DVADS S I LTS GFQPV 4 07 

I : I I I I I II I : I : : : : I | I I II I I I I I I 

Db 372 WS RRG I EN AS D I AL YS GL - GAAWAVAVLVT GVT L YRRS H S D YGVDVT DS S ALTGGFQT F 430 

Qy 408 SIKPSKADNPHLL — TIQPDLSTTTTTYQGSLCPRQDGPSPKFQLTNGHLLSPLG 460 

: I : I II : I I I I I : I I I : I III I : I I : I I 
Db 431 NFKTVRQGNSLLLNPAMQPDL-TVSRTYSGPIC-LQD-PLDKELMTESSLFNPLSDIKVK 487 

Qy 461 GGRH TLHHSSPTSEAEEFVSRLSTQNYFR 489 

II I : I : I :: I I 
Db 488 VQSSFMVSLGVSERAEYHGKNHSGTFPHGNNRGFSTIHPRNKT PYIQNLS 537 

Qy 490 SLPRGTSNMTYGTFNFLGGRLMI PNTGI SLLI PPDAI PRGKI YEI YLTLHKPEDVRLPLA 549 

III I III I I I II :: I I I I : I I I I I III : I I I : : : : : I I 
Db 538 SLPTRTELRTTGVFGHLGGRLVMPNTGVSLLIPHGAIPEENSWEIYMSINQGEP-SLQSD 596 

Qy 550 GCQTLLSPIVSCGPPGVLLTRPVI LAMDHCGEPSPDSWSLRLKKQSCEGSWEDVLHLGEE 609 

I : I I I I I : I I I I : I : I I I : II : I : I : : III:: : I I I : I : : : I 
Db 597 GS EVLLS P EVTCGP P DMLVTT P FALT I PHCADVS S EHWNI HLKKRTQQGKWEEVMS VEDE 656 

Qy 610 APSHLYYCQLEASACYVFTEQLGRFALVGEALSVAAAKRLKLLLFAPVACTSLEYNIRVY 669 

: I III: I I : I : I : I I I I : : I I : I I : : I : : I I I : I I : I I I 
Db 657 STS — CYCLLDPFACHVLLDSFGTYALTGEPITDCAVKQLKVAVFGCMSCNSLDYNLRVY 714 

Qy 670 CLHDTHDALKEWQLEKQLGGQLIQEPRVLHFKDSYHNLRLSIHDVPSSLWKSKLLVSYQ 729 

I : : I I : I I : I : I I I I : : I I : : I I I I : : I : : I : I : I I I : I : I 
Db 715 CVDNTPCAFQEVTSDERHQGGQLLEEPKLLHFKGNTFSLQVSVLDIPPFLWRIKPFTACQ 774 

Qy 730 EIPFYHIWNGTQRYLHCTFTLERVSPSTSDLACKLWVWQVEGDGQSFSINFNITKDTRFA 789 

I : I I : I : : : I I I I : I I I : I : I : I : I I : : I : : I I : : I : I 
Db 775 EVPFSRVWSSNRQPLHCAFSLERYTPTTTQLSCKICIRQLKGHEQILQVQTSILESERET 834 

Qy 790 ELLALESEAGVPALVGPSAFKIPFLIRQKIISSLDPPCRRGADWRTLAQKLHLDSHLSFF 849 

: II I I I I I I I : II I : I :: I I :| II: MM :: :||:| 

Db 835 ITFFAQEDSTFPAQTGPKAFKIPYSIRQRICATFDTPNAKGKDWQMLAQKNSINRNLSYF 894 

Qy 850 ASKPSPTAMILNLWEARHFPNGNLSQLAAAVAGLGQPDAGLFTVSEAE 897 

I :: I I : I : I II I II I I I : I : I III: : I : I : : I : 
Db 895 ATQSSPSAVI LNLWEARHQQDGDLDSLACALEEIGRTHTKLSNITEPQ 942 



RESULT 13 
UN5D HUMAN 



ID UN5D_HUMAN STANDARD; PRT; 953 AA. 

AC Q6UXZ4; Q8WYP7; 

DT 25-OCT-2004 (Rel. 45, Created) 

DT 25-OCT-2004 (Rel. 45, Last sequence update) 

DT 25-OCT-2004 (Rel. 45, Last annotation update) 

DE Netrin receptor UNC5D precursor (Unc-5 homolog D) (Unc-5 homolog 4) 

DE (UNQ6012/PRO34692) . 

GN Name=UNC5D; Synonyms=KIAA1777, UNC5H4; 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 

OC Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP SEQUENCE FROM N.A. (ISOFORM 1). 

RX MEDLINE=22887296; PubMed=12975309; DOI=10 . 1101/gr . 1293003 ; 

RA Clark H.F., Gurney A.L., Abaya E. , Baker K. , Baldwin D., Brush J., 

RA Chen J., Chow B., Chui C, Crowley C, Currell B. , Deuel B., Dowd P., 

RA Eaton D., Foster J., Grimaldi C, Gu Q. , Hass P.E., Heldens S., 

RA Huang A., Kim H.S., Klimowski L., Jin Y. , Johnson S., Lee J., 

RA Lewis L., Liao D., Mark M. , Robbie E., Sanchez C, Schoenfeld J., 

RA Seshagiri S., Simmons L., Singh J., Smith V., Stinson J., Vagts A., 

RA Vandlen R., Watanabe C, Wieand D., Woods K., Xie M.-H., Yansura D., 

RA Yi S., Yu G., Yuan J., Zhang M. , Zhang Z., Goddard A., Wood W.I., 

RA Godowski P., Gray A.; 

RT "The secreted protein discovery initiative (SPDI), a large-scale 

RT effort to identify novel human secreted and transmembrane proteins: a 

RT bioinf ormatics assessment . " ; 

RL Genome Res. 13:2265-2270(2003). 

RN [2] 

RP SEQUENCE FROM N.A. (ISOFORM 2). 

RC TISSUE=Brain; 

RA Nakajima D., Nakayama M., Nagase T., Ohara O.; 

RT "Identification of unc5H4 gene."; 

RL Submitted (JAN-2001) to the EMBL/ GenBank/DDBJ databases. 

CC -!- FUNCTION: Receptor for netrin. May be involved in axon guidance by 

CC mediating axon repulsion of neuronal growth cones in the 

CC developing nervous system upon ligand binding. Axon repulsion in 

CC growth cones may be caused by its association with DCC that may 

CC trigger signaling for repulsion. It also acts as a dependence 

CC receptor required for apoptosis induction when not associated with 

CC netrin ligand (By similarity) . 

CC SUBUNIT: Interacts with the cytoplasmic part of DCC (By 

CC . . similarity) . 

CC -!- SUBCELLULAR LOCATION: Type I membrane protein (By similarity). 

CC -!- ALTERNATIVE PRODUCTS: 

CC Event=Alternative splicing; Named isoforms=2; 

CC Name=l; 

CC IsoId=Q6UXZ4-l; Sequence=Displayed; 

CC Name=2 ; 

CC IsoId=Q6UXZ4-2; Sequence=VSP_011703 ; 

CC Note=No experimental confirmation available; 

CC -!- PTM: Phosphorylated on cytoplasmic tyrosine residues (By 

CC similarity) . 

CC -!- PTM: Proteolytically cleaved by caspases during apoptosis. The 



cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 
cc 

DR 
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DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
DR 
KW 
KW 



cleavage does not take place when the receptor is associated with 

netrin ligand. Its cleavage by caspases is required to induce 

apoptosis (By similarity) . 

SIMILARITY: Belongs to the UNC-5 family. 

SIMILARITY: Contains 1 death domain. 

SIMILARITY: Contains 1 immunoglobulin-like C2-type domain. 
SIMILARITY: Contains 1 immunoglobulin-like domain. 
SIMILARITY: Contains 2 TSP type-1 domains. 
SIMILARITY: Contains 1 ZU5 domain. 



This SWISS-PROT entry is copyright. It is produced through a collaboration 
between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 
the European Bioinf ormatics Institute. There are no restrictions on its 
use by non-profit institutions as long as its content is in no way 
modified and this statement is not removed. Usage by and for commercial 
entities requires a license agreement (See http://www.isb-sib.ch/announce/ 
or send an email to license@isb-sib . ch) . 



EMBL; AB055056; BAB83663.1; -. 
EMBL; AY358147; AAQ88514.1; 
HSSP; P07996; 1LSL. 
Genew; HGNC: 18634; UNC5D. 



InterPro; IPR000488 
InterPro; IPR011029 



InterPro; 
InterPro; 
InterPro; 
InterPro; 
InterPro; 



Death. 
DEATH_like. 
Ig-like. 
Ig_c2 . 
TSP1. 
TSP_1. 
ZU5. 
1. 



IPR007110; 
IPR003598; 
IPR000884; 
IPR008085; 
IPR000906; 
Pfam; PF00531; Death; 
Pfam; PF00047; ig; 1. 
Pfam; PF00090; TSP_1; 2. 
Pfam; PF00791; ZU5; 1. 
PRINTS; PR01705; TSP1REPEAT. 
SMART; SM00005; DEATH; 1. 
SMART; SM00408; IGc2; 1. 
SMART; SM00209; TSP1; 2. 

•PROSITE; PS50017; DEATH_DOMAIN; FALSE_NEG. 
PROSITE; PS50835; IG_LIKE; 1. 
PROSITE; PS50092; TSP1; 2. 

Alternative splicing; Apoptosis; Developmental protein; 
Immunoglobulin domain; Phosphorylation; Receptor; Repeat; Signal; 



KW 


Transmembrane . 






FT 


SIGNAL 


1 


32 


Potential. 


FT 


CHAIN 


33 


953 


Netrin receptor UNC5D. 


FT 


DOMAIN 


33 


379 


Extracellular (Potential) . 


FT 


TRANS MEM 


380 


400 


Potential . 


FT 


DOMAIN 


401 


953 


Cytoplasmic (Potential) . 


FT 


DOMAIN 


54 


151 


Ig-like . 


FT 


DOMAIN 


153 


244 


Ig-like C2-type. 


FT 


DOMAIN 


252 


306 


TSP type-1 1. 


FT 


DOMAIN 


308 


360 


TSP type-1 2. 


FT 


DOMAIN 


540 


642 


ZU5. 


FT 


DOMAIN 


859 


936 


Death. 


FT 


SITE 


416 


417 


Cleavage (by caspase-3) (By similarity) . 


FT 


SITE 


703 


721 


Interaction with DCC (By similarity) . 


FT 


DISULFID 


75 


134 


By similarity. 



U L, |' 

£ 1 


nT c ?TIT.F*Tn 


1 RD 


2^1 


Dy oXIlLXXclxXuy • 


C"P 

r x 


fARROHYTl 


117 

XX/ 


117 

XX/ 


in xinKeu i^xcin/vc. • • j i ronentiax; . 


FT 


fARROHYD 


228 

c. u 


228 


in xj.njs.ea. \oxciN/\c. ■ •) (rotcnuiai / . 


FT 


PADAriHvn 






w xxnKea \uxcin/\c. . . ; (rotentiai) . 


FT 


r*ADAr>wvn 
c/\Kr)^jn i u 


^7£ 


^76 


im— xinjcea (bicwAc. . . ; (rotentiai) . 


C 1 


VARCjPT.Tr 

V/T-TS.O JT Xi X \^ 


1 
X 




l v lorvr\M/\x/\oouljo/\KKW XjJrVVxjoxjL.r Wr\AVji>\A/\K -> JMXxj 


FT 








VLVKALSDVCAGTSGFLLDFSSQTSP (in isoform 


FT 








2) . 


FT 








/FTId=VSP 011703. 


SQ 


SEQUENCE 


953 AA; 


105879 


MW; 5F893B9DF746F731 CRC64; 



Query Match 45.9%; Score 2199.5; DB 1; Length 953; 

Best Local Similarity 45.5%; Pred. No. 1.6e-156; 

Matches 428; Conservative 161; Mismatches 284; Indels 67; Gaps 14; 



Qy 


8 


WPALLGIVLAAWLRGS GAQQSATVANPVPGANPDLLPHFLVEPEDVYIVKNKPVLL 

1 II 1 1 1 : 1 : : 1 1 1 1 1 1 1 : 1 1 : 1 1 1 : 1 : 1 : 1 
WLPWLG — LCFWAAGTAAARGTDNGEALPESIPSA-PGTLPHFIEEPDDAYIIKSNPIAL 


63 


Db 


17 


73 


Qy 


64 


VCKAWATQI FFKCNGEWVl*QVT>Hv^ 

III II 1 1 1 1 1 II 1 1 1 1 1 : 1 1 1 : 1 1 1 1 1 II llhlllll 1 1 : 1 1 
RCKARPAMQIFFKCNGEWV1HQNEHVSEETLDESSGLKVT*EW^ 


123 


Db 


74 


133 


Qy 

Db 


124 
134 


CQCVAWSSSGTTKSQKAYIRIARLRKNFEQEPLAKEVSLEQGIVLPCRPPEGIPPAEVEW 
lllllll ll:||:|| :IM 1111111:1 :|| :| 1 II 1 1 1 1 I 1 : I 1 1 1 1 1 
CQCVAWSHLGTSKSRKASWIAYLRKNFEQDPQGREVPIEC^IVXHCRPPEGVPAAEVEW 


183 
193 


Qy 


184 


LRNEDLVD P S LDPNVYITREHS LWRQARLADTAN YT CVAKNI VARRRSASAAVTVYVNG 
1:11: :| 1 |: : 1 : 1 : : 1 1 1 1 1 : 1 : III |:i lllhlll II |:||||| 
LKNEEP I DS EQDEN I DTRADHNLI I RQARLS D S GN YT CMAANI VAKRRS LS ATWVYVNG 


243 


Db 


194 


253 


Qy 


244 


GWSTWTEWSVCSASCGRGWQKRSRSCTNPAPLNGGAFCEGQNVQKTACATLCPVDGSWSP 
llhlllll 1 : 1 1 1 1 1 1 1 1 1 1 : 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 : 1 1 1 1 : 1 1 1 1 1 1 1 1 
GWSSWTEWSACNVRCGRGWQKRSRTCTNPAPLNGGAFCEGMSVQKITCTSLCPVDGSWEV 


303 


Db 


254 


313 


Qy 


304 


WSKWSACGLDCTHWRSRECSDPAPRNGGEECQGTDLDTRNCTSDLCV HS 

Ihlll : 1 1 1 1 1 1 : 1 1 1 1 1 1 : 1 : 1 : : 1 1 1 1 1 : I 
WSEWSVCSPECEHLRIRECTAPPPRNGGKFCEGLSQESENCTDGLCILDKKPLHEIKPQS 


352 


Db 


314 


373 


Qy 


353 


ASGPEDVALWGLIAVAVCLVXLLLVLILVYCRKKEGLDSDVADSSILTSGFQPVSIKPS 

1 : 1 1 1 1 1 II I : I : : : : | | : 1 1 1 1 1 1 1 1 1 1 : 1 
I EN AS D I AL Y S GL- GAAWAVAVLVI GVT L YRRS Q S D YGVDVT DS S ALT GG FQT FN FKTV 


412 


Db 


374 


432 


Qy 


413 


KADNPHLL— TIQPDLSTTTTTYQGSLCPRQDGPSPKFQLTNGHLLSPLG 

: 1 II : 1 1 1 1 1 : 1 1 1 : 1 III 1 : 1 1 : 1 1 
RQGNSLLLNSAMQPDL-TVSRTYSGPIC-LQD-PLDKELMTESSLFNPLSDIKVKVQSSF 


460 


Db 


433 


489 


Qy 


461 


GGRHTLHHSSPTSEAEEFVSRLSTQNYFRSLPRGTSN 

II: 1 : : : : 1 1 1 1 1 1 
MVSLGVSERAEYHGKNHSRTFPHGNNHSFSTMHPRNKM-PYIQNLS SLPTRTEL 


497 


Db 


490 


542 


Qy 


498 


MTYGTFNFLGGRLMIPNTGISLLIPPDAIPRGKIYEIYLTLHKPEDVRLPLAGCQTLLSP 
III 1 1 1 1 I : : 1 1 1 1 : 1 1 1 1 1 III : I | | : :::: | | 1 : 1 1 1 1 
RTTGVFGHLGGRLVMPNTGVSLLIPHGAIPEENSWEIYMSINQGEP-SLQSDGSEVLLSP 


557 


Db 


543 


601 



Qy 558 IVSCGPPGVLLTRPVILAMDHCGEPSPDSWSLRLKKQSCEGSWEDVLHLGEEAPSHLYYC 617 

I : I I I I : : : I I I : I I : I : I : : I I I : : : I I I : I : : : I : I II 



Db 602 EVTCGPPDMIWTPFALTIPHCADVSSEHWNIHLKKRTQQGKWEEVMSVEDESTS — CYC 659 



Qy 618 QLEASACYVFTEQLGRFALVGEALSVAAAKRLKLLLFAPVACTSLEYNIRVYCLHDTHDA 677 

I : I I : I : I : I I I I : : I I : I I : : I : : ! I I : I I : I I I I : : I I 
Db 660 LLDPFACHVLLDSFGTYALTGEPITDCAVKQLKVAVFGCMSCNSLDYNLRVYCVDNTPCA 719 

Qy 678 LKEWQLEKQLGGQLIQEPRVLHFKDSYHNLRLSIHDVPSSLWKSKLLVSYQEIPFYHIW 737 

: I I I I : I I II : : I I : : I I I I : : I : :| : I : I I I : I : I I : I I : I 
Db 720 FQEWSDERHQGGQLLEEPKLLHFKGNTFSLQISVLDIPPFLWRIKPFTACQEVPFSRVW 779 

Qy 738 NGTQRYLHCTFTLERVSPSTSDLACKLWVWQVEGDGQSFSINFNITKDTRFAELLALESE 797 

: : I I I I : I I I : I : I : I : I I : : I : : I I : : I : I : : 

Db 780 CSNRQPLHCAFSLERYTPTTTQLSCKICIRQLKGHEQILQVQTSILESERETITFFAQED 839 

Qy 798 AGVPALVGPSAFKIPFLIRQKIISSLDPPCRRGADWRTLAQKLHLDSHLSFFASKPSPTA 857 

: II I I I I I I I : I I I : I :: I I : I I I : I I I I : : : I I : I I : : | | : | 
Db 840 STFPAQTGPKAFKIPYSIRQRICATFDTPNAKGKDWQMLAQKNSINRNLSYFATQSSPSA 899 

Qy 858 MI LNLWEARH FPNGNL SQLAAAVAGLQQ P DAGL FTVS EAE 897 

: I I I I I I I II : I : I III: : I : I : I I : : 
Db 900 VTLNLWEARHQHDGDLDSLACALEEIGRTHTKLSNISESQ 939 



RESULT 14 
Q7PW78 

ID Q7PW78 PRELIMINARY; PRT; 876 AA. 

AC Q7PW78; 

DT 01-MAR-2004 (TrEMBLrel. 26, Created) 

DT 01-MAR-2004 (TrEMBLrel. 26, Last sequence update) 

DT 01-MAR-2004 (TrEMBLrel. 26, Last annotation update) 

DE ENSANGP00000005212 (Fragment) . 

GN Name=ENSANGG00000004014; 

OS Anopheles gambiae str. PEST. 

OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; 

OC Neoptera; Endopterygota; Diptera; Nematocera; Culicoidea; Anopheles. 

OX NCBI_TaxI D=l 8 0 4 5 4 ; 

RN [1] 

RP SEQUENCE FROM N.A. 

RC STRAIN=PEST; 

RA Anopheles Genome Sequencing Consortium; 

RL Submitted (APR-2003) to the EMBL/GenBank/DDBJ databases. 

CC -!- CAUTION: The sequence shown here is derived from an 

CC EMBL/GenBank/DDBJ whole genome shotgun (WGS) entry which is 

CC preliminary data. 

DR . EMBL; AAAB01008984 ; EAA14755.1; -. 

DR GO; GO: 0005515; F:protein binding; IEA. 

DR GO; GO: 0007165; P: signal transduction; IEA. 

DR InterPro; IPR000488; Death. 

DR InterPro; IPR011029; DEATH_like . 

DR InterPro; IPR007110; Ig-like. 

DR InterPro; IPR000884; TSP1. 

DR InterPro; IPR000906; ZU5 . 

DR Pfam; PF00531; Death; 1. 

DR Pfam; PF00090; TSP_1; 2. 

DR Pfam; PF00791; ZU5; 1. 

DR PROSITE; PS50017; DEATH JDOMAIN; 1. 

DR PROSITE; PS50835; IG LIKE; 1. 



DR PROSITE; PS50092; TSP1; 2. 

FT NON_TER 1 1 

FT NONJTER 876 876 

SQ SEQUENCE 876 AA; 96301 MW; 6F9336D53E096E00 CRC64; 

Query Match 21.9%; Score 1050.5; DB 2; Length 876; 

Best Local Similarity 30.6%; Pred. No. 5.1e-70; 

Matches 284; Conservative 138; Mismatches 368; Indels 137; Gaps 30; 

Qy 43 LPHFLv^PEDWIVT^NKPVT.LVCKAVPATQIFFKCN 102 

I I I I I I I : I : : I I : I I I I I I I I I I I : I : I II I I : 

Db 2 LPVFLVEPKGAYVMKNRPAKLYCKASHALQISFKCSGS-TKPPPTEKEHHTDPHSGVQLQ 60 

Qy 103 EVRIN^SRQQV^KVFGLEEYWCQCVAWSSSGTTKSQKAYIRIARLRKNFEQEPIJVKEVSL 162 

I ::|: I:: || : |:| |:|| | |:| |::| ::| | |: 

Db 61 EATATITRELVDEFFGKGPFKCECRAYSSRGHVKTQPVTIQVATIKKQISISPKIVRVAT 120 

Qy 163 EQGIVXPCRPPEGIPPAEVTSWLRNEDLVDPSLDPNVYITREH 222 

II I I : I I I : I I : I : I : : I : : : : I I I I I I I 

Db 121 GGRAELNC-IANATPAAKWWLKNS — VPVHANPPFVXLTENALLIARVEIQDMANYTCV 177 

Qy 223 AKNIVARRRSASAAVIVYWGGWSTWTEWSVCSA-SCGRGWQKRSRSCTNPAPLNGGAFC 281 

I : I I : I I : I I I : I I I I : I I : I I : I I I : I I : I I : I I I I 

Db 178 AENIAGKRVSDPVPITVYVDGGWSSWGPWTDCKCPGHGKQGQKRTRVCNSPVPMNNGAPC 237 

Qy 282 EGQNVQKTACATLCPVDGSWSPWSKWSACGLDCTHWRSRECSDPA-PRNGGE — ECQGTD 338 

:| : : I I I I I I I : I I I I I I I I I I I : I II 

Db 238 KGASTESTPDCLPCSA-GRWSSWSEWSECGPDCTQIRQRSCVAQAFTUIDSGTIVNCAGKS 296 

Qy 339 LDTRNCTSDLCVliSASGPE-DVALWGLIAVAVCLVLLLLVLILVYCRKKEGLDSDVADS 397 

: II I I : : I I I : I I : I I | | : : | : j : | : : : 

Db 297 QQS I KCTGGLCN YTAQDSNWS WLWVTLVA-AFCLGWFAV — SKFLRRKKTIPAYNLAR 353 

Qy 398 S I LTSGFQPVS I K- PSKAD-NPHLLTIQP DLSTTTTT 432 

III | :: : | | | | I I I I 

Db 354 SDLTQNTGPINYEYPMTASLQPHLAGHHPHHHHLQQQQHHHHLSSLHHQHGSSLLGPVAT 413 

Qy 433 YQGSL CPRQDGPSPKFQLTNGHLLSPLGGGRHTL HHSS 470 

II III: | : | | : : | | | 

Db 414 GHGQLHPQCQSQQAPT LPIGGLKSSLPLPRSNSEHHYDVPHLCNNTSPA 462 

Qy 471 — PTSEAEEFVSRLSTQNYFRSLPRGTSNMTYGTFNFLGGRLMIPNTGISLLIPPDAIPR 528 

I I : I I I : : I : I I I : : I I I I I I I : 

Db 463 IRPTTLVEE P FRGAEVTHAT LT PAGALLRLAT Y S TALLI P EGAI P K 508 

Qy 529 GKIYEIYLTLHKPE--DWLPIAGCQTLLSPIVSCGPPGVXLTRPVi:LAMDHCGEPSPDS 586 

: : : I : : : : I : I I I I I : I I I I : : I : : : : I I I | 

Db 509 HQRHSVALSIWDDKHHVPVPTGPRSTYLSPWFCGPVDTKVNKPIVMQLPHCAENLSD- 567 

Qy 587 WSLRLKKQSCEGS-WEDVXHLGEEAPSHLYYCQLEASACYVFTEQLGRFALVGEALS 642 

I : I : I I : : I I I : I :: II II I : : I I I I : : 

Db 568 WAFSLYSAPDNWPWCKWTIGEETLNTPALVQIDKRYAYVLTETFGKYVLVGESATDIQ 627 

Qy 643 -VAAAKRLKLLLFAPVACTSL-EYNIRVYCLHDTHDALKEWQLEKQLGGQLIQEPRVLH 700 

I I I I : I : I : : : M I : I I : I : :: I I I : III 

Db 628 ERVACKRLRLFICGPSTVPEFSDVSLRVYIVEDNPGAEERCRHCEQEIGGVLLGRSTVLH 687 



Qy 

Db 



701 FKD-SYHNLRLSIHDVPSSLWKSKLLVSYQEIPFYHIWNGTQRYLHCTFTLERVSPSTSD 759 

II I : : I I :: I I I I I I : I I I I I : I I I 
688 FADVGQAGLN I DLQCVGG — WRAKS S S ERQT I P FSHVWN S ACTALHCS FTL 736 



Qy 760 LACKLWVWQVEGDGQSFSINFNITKDTRFA ELLALESEAGVPA 802 

I : II II : : I I I : I I I I 
Db 737 — CR TEHDKCDFKIWQASQDVPQGLDERLTAI GVP ATL S I S S VGS GDHNT 785 

Qy 803 -LVGPSAFKIPFLIRQKIISSLDPPCRRGADWRTLAQKLHLDSHLSFFASKPSPTAMILN 861 

II I : : : :: I : MM:: Mill I : : I : I : : I I : : I I II I I : 
Db 786 NLVATDRFRLSKDVKRKLCRCLDPPTQKRNDWRMLAAHLNVDRYLTYFATRPSPTDQILD 845 

Qy 862 LWEARH FPN GN L S Q LAAAVAGLGQ PDA 888 

MM: Ml : : I II 

Db 846 LWECRNRDLNALQQLIEICRTMERPDA 872 



RESULT 15 
UNC5_DROME 

ID UNC5_DROME STANDARD; PRT; 1072 AA. 

AC Q95TU8; Q9NBL0; Q9V7B5; 

DT 25-OCT-2004 (Rel. 45, Created) 

DT 25-OCT-2004 (Rel. 45, Last sequence update) 

DT 25-JAN-2005 (Rel. 46, Last annotation update) 

DE Netrin receptor unc-5 precursor (Unc5 netrin receptor) . 

GN Name=unc-5; ORFNames=CG8166; 

OS Drosophila melanogaster (Fruit fly) . 

OC Eukaryota; Metazoa; Arthropoda; Hexapoda; Insecta; Pterygota; 

OC Neoptera; Endopterygota ; Diptera; Brachycera; Muscomorpha; 

OC Ephydroidea; Drosophilidae; Drosophila. 

OX NCBI_TaxID=7227 ; 

RN [1] 

RP SEQUENCE FROM N.A. , FUNCTION, AND TISSUE SPECIFICITY. 

RX PubMed=11719202; 

RA Keleman K. , Dickson B.J.; 

RT "Short- and long-range repulsion by the Drosophila Unc5 netrin 

RT receptor."; 

RL Neuron 32:605-617(2001). 

RN [2] 

RP SEQUENCE FROM N.A. 

RC STRAIN=Berkeley; 

RX MEDLINE=20196006; PubMed=10731132 ; DOI=10 . 1126/science. 287 . 5461 . 2185; 

RA Adams M.D., Celniker S.E., Holt R.A.., Evans C.A. , Gocayne J.D., 

RA Amanatides P.G., Scherer S.E., Li P.W., Hoskins R.A. , Galle R.F., 

RA George R.A. , Lewis S.E., Richards S., Ashburner M. , Henderson S.N., 

RA Sutton G. G. , . Wortman J.R., Yandell M.D., Zhang Q., Chen L.X., 

RA Brandon R.C., Rogers Y.-H.C, Blazej R.G., Champe M. , Pfeiffer B.D., 

RA Wan K.H., Doyle C. , Baxter E.G., Helt G. , Nelson C.R., Miklos G.L.G., 

RA Abril J.F., Agbayani A., An H.-J., Andrews-Pf annkoch C, Baldwin D., 

RA . Ballew R.M., Basu A., Baxendale J., Bayraktaroglu L., Beasley E.M., 

RA Beeson K.Y., Benos P.V., Berman B.P., Bhandari D., Bolshakov S., 

RA Borkova D. , Botchan M.R., Bouck J., Brokstein P., Brottier P., 

RA Burtis K.C., Busam D.A. , Butler H., Cadieu E., Center A., Chandra I., 

RA Cherry J.M. , Cawley S., Dahlke C, Davenport L.B., Da vies P., 

RA de Pablos B., Delcher A., Deng Z., Mays A.D., Dew I., Dietz S.M., 

RA Dodson K., Doup L.E., Dowries M. , Dugan-Rocha S., Dunkov B.C., Dunn P., 

RA Durbin K.J., Evangelista C.C., Ferraz C, Ferriera S., Fleischmann W., 



RA Fosler C, Gabrielian A.E., Garg N.S., Gelbart W.M., Glasser K. , 

RA Glodek A., Gong F. , Gorrell J.H., Gu Z., Guan P . , Harris M., 

RA Harris N.L., Harvey D.A., Heiman T.J., Hernandez J.R., Houck J. , 

RA Hostin D., Houston K.A. f Howland T.J., Wei M.-H., Ibegwam C, 

RA Jalali M. , Kalush F. , Karpen G.H., Ke Z., Kennison J. A. , Ketchum K.A. , 

RA Kimmel B.E., Kodira CD., Kraft C, Kravitz S., Kulp D., Lai Z., 

RA Lasko P., Lei Y., Levitsky A. A. , Li J.H., Li Z., Liang Y. , Lin X., 

RA Liu X., Mattei B., Mcintosh T.C., McLeod M.P., McPherson D., 

RA Merkulov G., Milshina N.V., Mobarry C, Morris J., Moshrefi A., 

RA Mount S.M., Moy. M. , Murphy B., Murphy L., Muzny D.M., Nelson D.L., 

RA Nelson D.R., Nelson K.A., Nixon K. , Nusskern D.R., Pacleb J.M., 

RA Palazzolo M., Pittman G.S., Pan S., Pollard J., Puri V. , Reese M.G., 

RA Reinert K., Remington K. , Saunders R.D.C., Scheeler F. , Shen H., 

RA Shue B.C., Siden-Kiamos I., Simpson M. , Skupski M.P., Smith T., 

RA Spier E. , Spradling A.C., Stapleton M. , Strong R. , Sun E., 

RA Svirskas R. , Tector C, Turner R., Venter E., Wang A.H., Wang X., 

RA Wang Z.-Y., Wassarman D.A., Weinstock G.M., Weissenbach J., 

RA Williams S.M., Woodage T., Worley K.C., Wu D., Yang S., Yao Q.A., 

RA Ye J., Yeh R.-F., Zaveri J.S., Zhan M. , Zhang G. , Zhao Q. , Zheng L., 

RA Zheng X.H., Zhong F.N., Zhong W., Zhou X., Zhu S., Zhu X., Smith H.O., 

RA Gibbs R.A. , Myers E.W., Rubin G.M., Venter J.C.; 

RT "The genome sequence of Drosophila melanogaster . "; 

RL Science 287:2185-2195(2000) . 

RN [3] 

RP GENOME REANNOTATION. 

RX MEDLINE=22426069; PubMed=12537572 ; 

RA Misra S., Crosby M.A., Mungall C.J. f Matthews B.B., Campbell K.S., 

RA Hradecky P., Huang Y., Kaminker J.S., Millburn G.H., Prochnik S.E., 

RA Smith CD., Tupy J.L., Whitfield E.J., Bayraktaroglu L., Berman B.P., 

RA Bettencourt B.R., Celniker S.E., de Grey A. D.N. J., Drysdale R.A. , 

RA Harris N.L., Richter J., Russo S., Schroeder A. J., Shu S.Q., 

RA Stapleton M. , Yamada C, Ashburner M. , Gelbart W.M., Rubin G.M., 

RA Lewis S.E.; 

RT "Annotation of the Drosophila melanogaster euchromatic genome: a 

RT systematic review."; 

RL Genome Biol. 3 : RESEARCH0083 . 1-RESEARCH0083 . 22 (2002) . 

RN [4] 

RP SEQUENCE FROM N.A. 

RC STRAIN=Berkeley; TISSUE=Embryo; 

RX MEDLINE=22426066; PubMed=12537569 ; 

RA Stapleton M. , Carlson J.W., Brokstein P., Yu C, Champe M. , 

RA George R.A. , Guarin H., Kronmiller B., Pacleb J.M., Park S., Wan K.H., 

RA Rubin G.M. , Celniker S.E. ; 

RT "A Drosophila full-length cDNA resource."; 

RL Genome Biol. 3 : RESEARCH0080 . 1-RESEARCH0080 . 8 (2002 ) . 

RN [5] 

RP FUNCTION. 

RX PubMed=12765609; 

RA Freeman M.R., Del row J., Kim J., Johnson E., Doe C.Q.; 

RT "Unwrapping glial biology: Gem target genes regulating glial 

RT development, diversification, and function."; 

RL Neuron 38:567-580 (2003) . 

CC -!- FUNCTION: Receptor for netrin required for motor axon guidance. 
CC Mediates both short- and long-range axon motor repulsion in the 

CC developing nervous system upon ligand binding. Also involved in 

CC glial migration. While short-range repulsion requires both fra and 

CC unc-5, long-range repulsion only requires unc-5. 



CC SUBCELLULAR LOCATION: Type I membrane protein (By similarity). 

CC -!- TISSUE SPECIFICITY: Prior to gastrulation, it is strongly 

CC expressed in the presumptive mesoderm. Mesodermal expression 

CC begins to fade during stages 13-14, persisting only in the cells 

CC that form the dorsal vessel. Expressed within the CNS from late 

CC stage 13, shortly after the first axons have extended. Detected in 

CC several dispersed clusters of cells within the CNS, increasing in 

CC number as development proceeds. Also expressed in the peripheral 

CC and exit glia, which migrate laterally out of the CNS between 

CC stages 14 and 17. Strongly expressed in motor axons that exit the 

CC CNS ipsilaterally via the segmental nerve root (SN) . Not expressed 

CC on either commissural or longitudinal axons within the CNS, nor on 

CC motor axons that exit via the intersegmental nerve (ISN) . In the 

CC periphery, it is detected on all branches of the SN. Also 

CC expressed at high level in exit and peripheral glia along both the 

CC SN and ISN. 

CC -!- PTM: Phosphorylated on different cytoplasmic tyrosine residues (By 
CC similarity) . 

CC -!- SIMILARITY: Belongs to the UNC-5 family. 
CC SIMILARITY: Contains 1 death domain. 

CC -!- SIMILARITY: Contains 1 immunoglobulin-like C2-type domain. 

CC -!- SIMILARITY: Contains 1 immunoglobulin- like domain. 

CC -!- SIMILARITY: Contains 2 TSP type-1 domains. 

cc 

CC This SWISS-PROT entry is copyright. It is produced through a collaboration 

CC between the Swiss Institute of Bioinf ormatics and the EMBL outstation - 

CC the European Bioinf ormatics Institute. There are no restrictions on its 

CC use by non-profit institutions as long as its content is in no way 

CC modified and this statement is not removed. Usage by and for commercial 

CC entities requires a license agreement (See http://www.isb-sib.ch/announce/ 

CC or send an email to license@isb-sib . ch) . 

CC 

DR EMBL; AF247762; AAF74193.1; -. 

DR EMBL; AE003811; AAF58143.2; 

DR EMBL; AY058501; AAL13730.1; -. 

DR HSSP; P07996; 1LSL. 

DR FlyBase; FBgn0034013; unc-5. 

DR GO; GO: 0007411; P:axon guidance; IGI. 

DR GO; GO: 0008347; P:glia cell migration; IMP. 

DR InterPro; IPR000488; Death. 

DR InterPro; IPR011029; DEATH_like. 

DR InterPro; IPR007110; Ig-like. 

DR InterPro; IPR003598; Ig_c2. 

DR InterPro; IPR000884; TSP1. 

DR Pfam; PF00531; Death; 1. 

DR Pfam; PF00047; ig; 1. 

DR SMART; SM00005; DEATH; 1. 

DR SMART; SM00408; IGc2; 1. 

DR SMART; SM00209; TSP1; 2. 

DR PROSITE; PS50017; DEATH_DOMAIN ; 1. 

DR PROSITE; PS50835; IG_LIKE; 1. 

DR PROSITE; PS50092; TSP1; 2. 

KW Developmental protein; Immunoglobulin domain; Phosphorylation; 

KW Receptor; Repeat; Signal; Transmembrane. 

FT SIGNAL 1 30 Potential. 

FT CHAIN 31 1072 Netrin receptor unc-5. 

FT DOMAIN 31 440 Extracellular (Potential). 



FT 


1 KAN oMr^M 


A A 1 


A £1 
fi O J. 


rotenciai . 


FT 


DOMAIN 


A £0 




Cytoplasmic (Potential) . 


FT 


T"\/VKVr A TXT 

DOMAIN 


1 O Q 




ig-iiJce . 


FT 


DOMAIN 


O'XO 


ol ft 


ig-iiKe L,z-uype. 


FT 


nriMaTM 

DUlYLttJ.JN 




OTQ 

o / y 


ior uype— 1 1, 


FT 


r\^MV>f7V TXT 

DOMAIN 


ooo 
or o 


499 


TSP type-1 2. 


FT 


T\/>X/TA TXT 

DOMAIN 


you 


i n c *7 
lUo / 


Death. 


FT 


HT OTTT TPT T\ 

DIoULe ID 


1 A Q 

149 


o m 
/ 


By similarity. 


FT 


r\T OTTT TJ»T T» 

DlbUlir ID 




o n "0 


By similarity. 


FT 


CARBOHYD 


79 


79 


N-linked (GlcNAc. . .) (Potential) 


FT 


CARBOHYD 


300 


300 


N-linked (GlcNAc. . .) (Potential) 


FT 


CONFLICT 


885 


885 


P -> S (in Ref . 1) . 


SQ 


SEQUENCE 


1072 


AA; 116416 


MW; A0A6B5A96B10138F CRC64; 



Query Match 20.7%; Score 992; DB 1; Length 1072; 

Best Local Similarity 28.7%; Pred. No. 1.7e-65; 

Matches 282; Conservative 145; Mismatches 383; Indels 174; Gaps 33; 

Qy 36 PGANPDLLPHFLVTCPEDW I VlOtfKPVLLVCKAV ERSTDG 95 

II : II 11:111 |::lil:| : I I I I : I : I I I : I : I I 

Db 120 PGEASNTLPIFLIEPESVFVVKNRPAVLKCKASHSLQVT FKCSGS-SQPPPSTHETHVDP 178 

Qy 96 SSGLPTMEWINVSRQQV^KVFGLEEYWCQCVAWSSSGTTKSQKAYIRIARLRKNFEQEP 155 

: I : II : I I :: I I : I : I I I I I I I I I I : I I : I I : I I I 
Db 179 HTGV1^EEWATIHRDLV1)EFFGDGPFKCECHAWSSRGVVTCSQAATVHIAYIRKSFNQSP 238 

Qy 156 I^EVSLEQGIVXPCRPPEGIPPAEV^WLRNEDLvT)PSLDPNW 215 

: : I I I II I I : : I : I : : : I : :: : I : I I I 

Db 239 TSLRLELGSR7VELRCEPPGGFPEPKLTWHKNNAVITADSEPGITVS-AGTLIFRQVALQH 297 

Qy 216 TANYTCVAKNIVARRRSASAAVTVYVNGGWSTWTEWSVCSA SCGRGWQKRSRSCTN 271 

111:1 1:11 I I I I I :| I I M II I i I E: I I I II Nihil 
Db 298 MANYSCSAENIAGRRVSDSAVXIVTVNGGWSTWSPWRECKCAGKPSQGR KRSRTCNN 354 

Qy 272 PAP LNGGAFC EGQNVQKT ACAT LC P VD GSWSPWSKWSACGLDC 314 

I I I I I I I I I : I I : I III M II II I : I 

Db 355 PMPLNGGAQCPGPQIQKSADCAACPEDTQIVSPDGFDISSSKRMARWSAWSDWSICSAEC 414 

Qy 315 THWRSREC SDP 325 

I 1:1 II 
Db 415 I QVRRRKC LTQGQTQ I S S EAEEAGDL LLGAP GVGMAAL I AAAGVGAVGS P S EATG S S S D I 474 

Qy 326 APRNGGEECQGTDLDTRNCTSDLCVHSASGPEDV — ALYVGLIAV-AVCLVLLLLVLILV 382 

II I I I : I I : I I : I I I : I I : I I I : : 
Db 475 I PGYGKSLCAGKDIQTAECRGEQC QIGKDDFDWTLYLGLAFITAVCFAFGTALI 528 

Qy 383 YCRKKEGLDSDV ADSSILTSGFQPVSIKPSKADNPHLLTIQPDLSTTTTTYQGSLCP 439 

I : I : : : | : : : : | : | | : : | | 
Db 529 -CCARRGIRTNPHYNMARSVMDADYMP-GWEKKEMRMHI EASNMGYDYV 576 

Qy 440 RQDGPS PKFQLTNGHLLS- PLGGGRHTLHH SSPTSEAEE 477 

I : : I I : : : I I I I I III: 
Db 577 NPGHRY-LPGEHIMGMGIGCGGVTEHHYDVPNLSANYTNPIDHLSVDYLSETGESST 632 



Qy 

Db 



478 FVSRLST — QNYFRSLPRGTSNMTYGTFNFLGGRLMIPNTGISLLIPPDAIPRGKIYEIY 535 

: I I I I : : : I I I I : I : : I : I I I : : 

633 ADTSNSTFDMNGKLSILNASKSSTYEMLGSAGGQLRLYGGELLLFVPEHAIGKHVKKHVS 692 



Qy 


536 


LTLHKPEDVRLPLAGCQT LLSPIVSCGPPGVLLTRPVTLAMDHCGEPSPDSWSLRL 

II II: II 1 1 : 1 1 : 1 1 1 1 : I I : I : I : : 
LLLLSDECSRV SCATESSILCSSWHSAPRNYSFVKPVILKIPHC-LVAPEQWHVHI 
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